类库 › IGPO
GuoqingWang1

GuoqingWang1/IGPO

IGPO是基于信息增益的策略优化方法,专为多轮搜索智能体设计。该仓库提供核心训练代码及DR-Venus深度研究智能体的实现,支持长 horizon 任务,旨在提升智能体在复杂搜索基准测试中的性能与效率。

GuoqingWang1/IGPO

技术栈

根目录 python

框架

FastAPI Flask

网络

Requests
查看全部依赖 (52)

依赖

Jinja2 NumPy Pandas Pillow PyYAML Pydantic accelerate aiohttp beautifulsoup4 cachetools cloudpickle codetiming datasets dill einops filelock html2text huggingface_hub hydra-core mammoth markdownify modelscope omegaconf openai packaging pathvalidate pdfminer.six peft psutil puremagic pyarrow pydub pylatexenc python-pptx ray safetensors serpapi setuptools smolagents starlette sympy tensordict tomli torch torchdata tqdm transformers triton uvicorn vllm wandb youtube_transcript_api

截图

./images/Framework.png
./images/Exp.png

评论

ホーム - Wiki
Copyright © 2011-2026 iteam. Current version is 2.155.2. UTC+08:00, 2026-06-02 03:05
浙ICP备14020137号-1 $お客様$