Large-Model-RL-Lib
Popular repositories Loading
-
telescope
telescope PublicForked from eduardoslonski/telescope
Scalable high-performance async RL post-training framework for LLMs with real-time observability dashboard
-
AReaL
AReaL PublicForked from inclusionAI/AReaL
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Python
-
ART
ART PublicForked from OpenPipe/ART
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!
Python
-
atropos
atropos PublicForked from NousResearch/atropos
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
Python
-
miles
miles PublicForked from radixark/miles
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
Python
-
Repositories
- Relax Public Forked from redai-infra/Relax
An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale
Large-Model-RL-Lib/Relax’s past year of commit activity - uni-agent Public Forked from verl-project/uni-agent
A unified framework for building, running, and training general agents at scale.
Large-Model-RL-Lib/uni-agent’s past year of commit activity - miles Public Forked from radixark/miles
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
Large-Model-RL-Lib/miles’s past year of commit activity - AReaL Public Forked from inclusionAI/AReaL
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Large-Model-RL-Lib/AReaL’s past year of commit activity - OpenRLHF Public Forked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
Large-Model-RL-Lib/OpenRLHF’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…