Travis Wangbz2023

💭

I may be slow to respond.

Popular repositories Loading

qylg01- qylg01- Public
test test Public
test02 test02 Public

Python
A_Share_investment_Agent A_Share_investment_Agent Public

Forked from 24mlight/A_Share_investment_Agent

Python
verl-agent verl-agent Public

Forked from langfengQ/verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python
MedicalGPT MedicalGPT Public

Forked from shibing624/MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python