GAIR-NLP/daVinci-MagiHuman

在线工具

在线工具

排行榜

反馈

在线工具

首页话题

文库码库小摊

更多

类库 › daVinci-MagiHuman

GAIR-NLP/daVinci-MagiHuman

daVinci-MagiHuman是一个用于生成音频-视频内容的单流Transformer基础模型。它能根据文本快速生成高质量、富有表现力的人像视频，包含同步的语音和动作，支持多种语言。特点包括推理速度快、生成质量高，并完全开源模型和代码。

1,585 134 1,585 24

在 GitHub 上查看

技术栈

查看全部依赖 (31)

依赖

Pandas Pydantic accelerate av beautifulsoup4 boto3 debugpy depyf diffusers ffmpeg-python ftfy graphviz imageio loguru mosaicml_streaming numba packaging psycopg2-binary pydantic-settings redis redislite rich scipy sentencepiece setuptools soundfile timm torchao transformers unfoldNd versioningit