🎯
Focusing
AI Scientist @microsoft
-
Microsoft
- Hyderabad
- https://www.kaggle.com/shashankshuklacodedl
- in/shashank-shekhar-shukla-722859227
Pinned Loading
-
grpo-composer
grpo-composer PublicThe first comprehensive toolkit for Group Relative Policy Optimization, unifying over 20 state-of-the-art variants (including TIC-GRPO, KRPO, and RankGRPO) into a modular, config-driven engine for …
Python 3
-
AMD_AI_Premierre_League
AMD_AI_Premierre_League PublicThe task is to build two competing AI agents for a 1v1 knockout tournament: a Q-agent that generates formatted puzzle-based questions on given topics, and an A-agent that must correctly answer them.
Jupyter Notebook
-
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

