Skip to content
View XiaofengLin7's full-sized avatar
🐯
Focusing
🐯
Focusing

Block or report XiaofengLin7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 2,399 282 Updated Oct 5, 2025

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, GitHub Copilot CLI, OpenClaw, Factory Droid, Trae, Google Antigravity). Turn any folder of code, docs, papers, images, o…

Python 27,764 3,025 Updated Apr 16, 2026

Academic Research Skills for Claude Code: research → write → review → revise → finalize

Python 2,903 329 Updated Apr 16, 2026

Public repository for Agent Skills

Python 118,453 13,674 Updated Apr 13, 2026

AI agents running research on single-GPU nanochat training automatically

Python 73,005 10,652 Updated Mar 26, 2026

This project includes code for using the WebGym framework to train web agentic models.

Python 32 2 Updated Feb 25, 2026
Python 103 8 Updated Apr 9, 2026

Co-evolving policy actors and experience extractors for efficient experience-driven agent RL

Python 47 3 Updated Apr 4, 2026

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,344 371 Updated Apr 13, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,973 523 Updated Apr 16, 2026

The absolute trainer to light up AI agents.

Python 16,882 1,474 Updated Apr 3, 2026

Reinforcement Learning from Text Feedback

Python 31 2 Updated Feb 17, 2026
Python 9 1 Updated Feb 25, 2026

Reinforcement Learning via Self-Distillation (SDPO)

Python 775 82 Updated Feb 18, 2026

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Python 314 36 Updated Mar 16, 2026

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

12,463 2,008 Updated Aug 31, 2023

CL-bench: A Benchmark for Context Learning

Python 505 29 Updated Feb 8, 2026

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

288 14 Updated Sep 8, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 388 90 Updated Apr 15, 2026

A Gym for Agentic LLMs

Python 477 31 Updated Jan 21, 2026
Python 46 4 Updated Jun 24, 2025

Defeating the Training-Inference Mismatch via FP16

Python 189 17 Updated Nov 14, 2025

The best ChatGPT that $100 can buy.

Python 51,935 6,901 Updated Apr 14, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,725 9,704 Updated Nov 12, 2025

Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication

Python 685 54 Updated Dec 2, 2025

[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)

Python 957 49 Updated Apr 13, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,329 727 Updated Apr 16, 2026

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 516 37 Updated Jun 6, 2025

Democratizing Reinforcement Learning for LLMs

Python 5,434 542 Updated Apr 14, 2026
Next