类库 › Relax
redai-infra

redai-infra/Relax

Relax是一个开源的高性能异步强化学习引擎,专注于大规模多模态大语言模型的后训练。采用面向服务的架构,实现训练与推理的完全解耦,支持文本、图像、视频和音频的端到端多模态强化学习。

redai-infra/Relax

技术栈

根目录 javascript

框架

FastAPI Vue.js ^3.4.0

测试

pytest

网络

Requests
查看全部依赖 (42)

依赖

NumPy Pydantic apprise av blessed blobfile clearml colorlog datasets debugpy dool gpustat httpx huggingface_hub imageio ipdb librosa loguru markdown-it-mathjax3 ^4.3.2 math_verify mcp memray omegaconf pillow py-spy pylatexenc pystack pytest-asyncio pyyaml ray ring_flash_attn sglang-router tensorboard transformers uvicorn wandb

开发依赖

@types/node ^20.19.37 medium-zoom ^1.1.0 mermaid ^10.0.0 swagger-ui-dist ^5.32.1 vitepress ^1.0.0 vitepress-plugin-mermaid ^2.0.0

截图

./assets/Relax.jpg
./assets/arch.png

评论

inicio - Wiki
Copyright © 2011-2026 iteam. Current version is 2.155.1. UTC+08:00, 2026-04-19 01:56
浙ICP备14020137号-1 $mapa de visitantes$