Skip to content
View ignaciocases's full-sized avatar

Block or report ignaciocases

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The world's first agentic reverse engineer.

Python 631 87 Updated Apr 3, 2026

An interface library for RL post training with environments.

Python 1,612 332 Updated Apr 14, 2026

Agentic RL Training at Scale

Python 1,292 257 Updated Apr 15, 2026

Our library for RL environments + evals

Python 4,008 528 Updated Apr 14, 2026

Reproducible, flexible LLM evaluations

Python 363 84 Updated Mar 24, 2026

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 615 59 Updated Apr 6, 2026

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 78 16 Updated Aug 17, 2024

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,798 168 Updated Feb 27, 2026

PyTorch-native post-training at scale

Python 667 96 Updated Apr 14, 2026

Learn the building blocks of how to build DeepSeek from scratch.

Jupyter Notebook 122 33 Updated Apr 13, 2026

Fast C++ Pytorch extension for differentiable synthetic aperture radar image formation and autofocus library on CPU and GPU

Python 190 34 Updated Feb 8, 2026

The Core Flight System (cFS)

CMake 1,258 346 Updated Apr 14, 2026

Preempt-RT Kernel Build Guide for NVIDIA Development Board

Shell 27 6 Updated Jun 21, 2024

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 515 37 Updated Jun 6, 2025

NeXT hardware emulator for a NeXT Cube and NeXT Station. Mirrored from SourceForge

C 90 15 Updated Dec 12, 2017

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 563 112 Updated Mar 17, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,119 8,579 Updated Apr 12, 2026

Align Anything: Training All-modality Model with Feedback

Python 4,646 506 Updated Nov 27, 2025

Train transformer language models with reinforcement learning.

Python 18,051 2,643 Updated Apr 15, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,347 917 Updated Apr 15, 2026

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,193 166 Updated Mar 17, 2026

A project that provides help for using DeepMind's mctx on gym-style environments.

Python 65 12 Updated Nov 14, 2024

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 520 98 Updated Sep 6, 2024
Python 12 1 Updated Jun 29, 2021

Recources to build the MFOS - Noise Toaster Synth by Ray Wilson

HTML 15 3 Updated Mar 25, 2024

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 286 29 Updated May 26, 2024

A library for advanced large language model reasoning

Python 2,340 203 Updated Jun 10, 2025

An extensible benchmark for evaluating large language models on planning

PDDL 458 48 Updated Sep 17, 2025

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,568 189 Updated Apr 5, 2026

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,872 1,395 Updated Mar 24, 2026
Next