XiaofengLin7

🐯

Focusing

XiaofengLin XiaofengLin7

🐯

Focusing

13 followers · 19 following

Boston, MA
20:34 (UTC -12:00)
https://xiaofenglin7.github.io/

Achievements

Lists (1)

Sort

✨ Inspiration

1 repository

Stars

Memento-Teams / Memento

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 2,399 282 Updated Oct 5, 2025

safishamsi / graphify

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, GitHub Copilot CLI, OpenClaw, Factory Droid, Trae, Google Antigravity). Turn any folder of code, docs, papers, images, o…

Python 27,764 3,025 Updated Apr 16, 2026

Imbad0202 / academic-research-skills

Academic Research Skills for Claude Code: research → write → review → revise → finalize

Python 2,903 329 Updated Apr 16, 2026

anthropics / skills

Public repository for Agent Skills

Python 118,453 13,674 Updated Apr 13, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 73,005 10,652 Updated Mar 26, 2026

microsoft / webgym

This project includes code for using the WebGym framework to train web agentic models.

Python 32 2 Updated Feb 25, 2026

siyan-zhao / OPSD

Python 103 8 Updated Apr 9, 2026

pUmpKin-Co / ComplementaryRL

Co-evolving policy actors and experience extractors for efficient experience-driven agent RL

Python 47 3 Updated Apr 4, 2026

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,344 371 Updated Apr 13, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 4,973 523 Updated Apr 16, 2026

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 16,882 1,474 Updated Apr 3, 2026

Emilianopp / Privileged-Information-Distillation-and-Self-Distillation

13 Updated Feb 24, 2026

lili-chen / rltf

Reinforcement Learning from Text Feedback

Python 31 2 Updated Feb 17, 2026

YX-S-Z / texas-holdem-arena

Python 9 1 Updated Feb 25, 2026

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 775 82 Updated Feb 18, 2026

Snowflake-Labs / agent-world-model

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Python 314 36 Updated Mar 16, 2026

khangich / machine-learning-interview

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

12,463 2,008 Updated Aug 31, 2023

Tencent-Hunyuan / CL-bench

CL-bench: A Benchmark for Context Learning

Python 505 29 Updated Feb 8, 2026

dunnolab / awesome-in-context-rl

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

288 14 Updated Sep 8, 2025

TextArena / TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 388 90 Updated Apr 15, 2026

axon-rl / gem

A Gym for Agentic LLMs

Python 477 31 Updated Jan 21, 2026

sunblaze-ucb / omega

Python 46 4 Updated Jun 24, 2025

sail-sg / Precision-RL

Defeating the Training-Inference Mismatch via FP16

Python 189 17 Updated Nov 14, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 51,935 6,901 Updated Apr 14, 2026

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,725 9,704 Updated Nov 12, 2025

google-deepmind / disco_rl

Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication

Python 685 54 Updated Dec 2, 2025

RUC-NLPIR / ARPO

[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)

Python 957 49 Updated Apr 13, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,329 727 Updated Apr 16, 2026

THUDM / WebRL

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 516 37 Updated Jun 6, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,434 542 Updated Apr 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XiaofengLin XiaofengLin7

Achievements

Achievements

Block or report XiaofengLin7

Lists (1)

✨ Inspiration

Stars

Memento-Teams / Memento

safishamsi / graphify

Imbad0202 / academic-research-skills

anthropics / skills

karpathy / autoresearch

microsoft / webgym

siyan-zhao / OPSD

pUmpKin-Co / ComplementaryRL

microsoft / LMOps

Gen-Verse / OpenClaw-RL

microsoft / agent-lightning

Emilianopp / Privileged-Information-Distillation-and-Self-Distillation

lili-chen / rltf

YX-S-Z / texas-holdem-arena

lasgroup / SDPO

Snowflake-Labs / agent-world-model

khangich / machine-learning-interview

Tencent-Hunyuan / CL-bench

dunnolab / awesome-in-context-rl

TextArena / TextArena

axon-rl / gem

sunblaze-ucb / omega

sail-sg / Precision-RL

karpathy / nanochat

karpathy / nanoGPT

google-deepmind / disco_rl

RUC-NLPIR / ARPO

THUDM / slime

THUDM / WebRL

rllm-org / rllm