类库 › MOSS-TTS-Nano
OpenMOSS

OpenMOSS/MOSS-TTS-Nano

MOSS-TTS-Nano是一个开源的轻量级多语言语音生成模型,仅0.1B参数,专为实时语音合成设计。它无需GPU即可在CPU上运行,部署简单,适用于本地演示、Web服务和轻量级产品集成。

2,098 281 2,098 31
在 GitHub 上查看
OpenMOSS/MOSS-TTS-Nano

技术栈

根目录 python

框架

FastAPI
查看全部依赖 (9)

依赖

NumPy WeTextProcessing python-multipart sentencepiece soundfile torch torchaudio transformers uvicorn

截图

./assets/images/concept.png
./assets/images/arch_moss_tts_nano.png
./assets/images/arch_moss_audio_tokenizer_nano.png
./assets/images/moss_tts_family.jpeg

评论

Home - Wiki
Copyright © 2011-2026 iteam. Current version is 2.155.1. UTC+08:00, 2026-04-25 19:01
浙ICP备14020137号-1 $Map of visitor$