verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,798 168 Updated Feb 27, 2026

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 667 96 Updated Apr 14, 2026

VizuaraAILabs / DeepSeek-From-Scratch

Learn the building blocks of how to build DeepSeek from scratch.

Jupyter Notebook 122 33 Updated Apr 13, 2026

Ttl / torchbp

Fast C++ Pytorch extension for differentiable synthetic aperture radar image formation and autofocus library on CPU and GPU

Python 190 34 Updated Feb 8, 2026

nasa / cFS

The Core Flight System (cFS)

CMake 1,258 346 Updated Apr 14, 2026

hmxf / RTJetson

Preempt-RT Kernel Build Guide for NVIDIA Development Board

Shell 27 6 Updated Jun 21, 2024

THUDM / WebRL

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 515 37 Updated Jun 6, 2025

probonopd / previous

NeXT hardware emulator for a NeXT Cube and NeXT Station. Mirrored from SourceForge

C 90 15 Updated Dec 12, 2017

ServiceNow / AgentLab

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 563 112 Updated Mar 17, 2026

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,119 8,579 Updated Apr 12, 2026

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 4,646 506 Updated Nov 27, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 18,051 2,643 Updated Apr 15, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,347 917 Updated Apr 15, 2026

ServiceNow / BrowserGym

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,193 166 Updated Mar 17, 2026

bwfbowen / muax

A project that provides help for using DeepMind's mctx on gym-style environments.

Python 65 12 Updated Nov 14, 2024

princeton-nlp / WebShop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 520 98 Updated Sep 6, 2024

chrisgrimm / muzero

Python 12 1 Updated Jun 29, 2021

Tom-Obvious / MFOS-NoiseToaster

Forked from samzeter/noise-toaster

Recources to build the MFOS - Noise Toaster Synth by Ray Wilson

HTML 15 3 Updated Mar 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ignacio Cases ignaciocases

Achievements

Achievements

Block or report ignaciocases

Stars

amruth-sn / kong

meta-pytorch / OpenEnv

PrimeIntellect-ai / prime-rl

PrimeIntellect-ai / verifiers

allenai / olmes

rlresearch / dr-tulu

hamishivi / EasyLM

langfengQ / verl-agent