I may be slow to respond.
Popular repositories Loading
-
-
-
-
A_Share_investment_Agent
A_Share_investment_Agent PublicForked from 24mlight/A_Share_investment_Agent
Python
-
verl-agent
verl-agent PublicForked from langfengQ/verl-agent
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Python
-
MedicalGPT
MedicalGPT PublicForked from shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.