zhoutong-hai

Follow

zhoutong-hai

Follow

Achievements

Achievements

Popular repositories Loading

ms-swift ms-swift Public

Forked from modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python
Megatron-LM Megatron-LM Public

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python
mergekit mergekit Public

Forked from arcee-ai/mergekit

Tools for merging pretrained large language models.

Python
SDPO SDPO Public

Forked from lasgroup/SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python
verl verl Public

Forked from verl-project/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python