类库 › MOSS-TTS-Nano
OpenMOSS

OpenMOSS/MOSS-TTS-Nano

MOSS-TTS-Nano是一个开源的轻量级多语言语音生成模型,仅0.1B参数,专为实时语音合成设计。它无需GPU即可在CPU上运行,部署简单,适用于本地演示、Web服务和轻量级产品集成。

3,018 384 3,018 44
在 GitHub 上查看
OpenMOSS/MOSS-TTS-Nano

技术栈

根目录 python

框架

FastAPI
查看全部依赖 (9)

依赖

NumPy WeTextProcessing python-multipart sentencepiece soundfile torch torchaudio transformers uvicorn

截图

./assets/images/moss_tts_2_requirements_gathering.jpg
./assets/images/concept.png
./assets/images/arch_moss_tts_nano.png
./assets/images/arch_moss_audio_tokenizer_nano.png
assets/images/evaluation_table_moss_audio_tokenizer_nano.png
assets/images/evaluation_fig_moss_audio_tokenizer.png
./assets/images/moss_tts_family.jpeg

评论

- 위키
Copyright © 2011-2026 iteam. Current version is 2.155.2. UTC+08:00, 2026-05-17 05:17
浙ICP备14020137号-1 $방문자$