🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

Pythonotheragentagenticagentic-aiagentic-rl

推荐理由
README 将它定位为「🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems」,核心价值是把 README 中描述的能力做成可以直接评估的开源实现。它已经有基础社区关注但还没过度出圈,最近两周仍在维护,主要技术栈是 Python,适合作为「AI agent 工具链」的候选项目。

注意事项
license 信息不够明确,采用前要确认授权边界;社区规模还在成长,建议先用小场景试跑;README 摘要信息有限,发布前建议再人工扫一遍文档;展示前建议跑通 README quickstart,并确认部署成本、外部依赖和数据安全边界。

README 摘要

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

Stars
1,468
Forks
72
Last push
2026/5/8

相似项目

GoogleCloudPlatform/professional-services

Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product.

GitHub
3K
stars
Pythonapache-2.0bigqueryexamplesgkegoogle-cloud-compute

推荐理由:README 将它定位为「Common solutions and tools developed by Google Cloud's Professional Services team」,核心痛点是 AI agent 接入工具/API 时的统一入口、治理和可靠调用。它有一定社区验证,同时仍保留发现潜力,最近两周仍在维护,license 清晰,主要技术栈是 Python,适合作为「同类问题选型」的候选项目。

注意事项:README 摘要信息有限,发布前建议再人工扫一遍文档;展示前建议跑通 README quickstart,并确认部署成本、外部依赖和数据安全边界。

MervinPraison/PraisonAI

PraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous self-improving agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, RAG, and support for 100+ LLMs.

GitHub
7.1K
stars
Pythonmitagentsaiai-agent-frameworkai-agent-sdk

推荐理由:README 将它定位为「PraisonAI 🦞 — Hire a 24/7 AI Workforce」,核心痛点是把业务文档和知识库变成可检索、可追问的 RAG 系统。它有一定社区验证,同时仍保留发现潜力,最近两周仍在维护,license 清晰,主要技术栈是 Python,适合作为「AI agent 工具链」的候选项目。

注意事项:README 摘要信息有限,发布前建议再人工扫一遍文档;展示前建议跑通 README quickstart,并确认部署成本、外部依赖和数据安全边界。