Popular repositories Loading
-
ms-swift
ms-swift PublicForked from modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
Python
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Python
-
mergekit
mergekit PublicForked from arcee-ai/mergekit
Tools for merging pretrained large language models.
Python
-
SDPO
SDPO PublicForked from lasgroup/SDPO
Reinforcement Learning via Self-Distillation (SDPO)
Python
-
verl
verl PublicForked from verl-project/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Python
If the problem persists, check the GitHub status page or contact support.

