Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[PD] Add KV transfer metric fallback tests
#24446 opened May 5, 2026 by OCWC22 Loading…
deepseek_v2: route mega-MoE pre-dispatch through DeepGEMM + FP4 acts opt-in deepseek
#24444 opened May 5, 2026 by pranjalssh Contributor Loading…
3 tasks done
test(prefill-delayer): pin offline-gen throughput test to triton backend
#24442 opened May 5, 2026 by YAMY1234 Contributor Loading…
2 tasks
[Docs] Add B200, GB200, GB300 NVIDIA hardware platform support for Kimi-K2.6 documentation Improvements or additions to documentation
#24441 opened May 5, 2026 by zijiexia Contributor Draft
5 tasks
fix(req_pool): bump pool.size to match actual tensor row count after #24243 run-ci
#24439 opened May 5, 2026 by JustinTong0323 Collaborator Loading…
1 of 3 tasks
[Gemma 4] Adding MTP support
#24436 opened May 5, 2026 by kpham-sgl Collaborator Loading…
Update Qwen3-Coder docs_new NVIDIA guidance documentation Improvements or additions to documentation
#24435 opened May 5, 2026 by wenscarl Collaborator Loading…
[NemotronH] Fix expert scale weight loading
#24434 opened May 5, 2026 by chfeng-cs Loading…
5 tasks done
[diffusion] fix: fix diffusion FSDP sharding diffusion SGLang Diffusion run-ci
#24431 opened May 5, 2026 by mickqian Collaborator Draft
[model-support] Add support for Bamba
#24430 opened May 5, 2026 by ppraneth Contributor Loading…
5 tasks
Support NemotronHPuzzleForCausalLM
#24429 opened May 5, 2026 by netanel-haber Contributor Draft
[NPU] fix profiler on NPU
#24422 opened May 5, 2026 by zhaozx-cn Contributor Loading…
5 tasks done
[codex] Add diffusion performance mode defaults diffusion SGLang Diffusion documentation Improvements or additions to documentation jit-kernel
#24419 opened May 5, 2026 by mickqian Collaborator Loading…
[PD] Fix KV transfer metrics run-ci
#24416 opened May 5, 2026 by cctry Collaborator Loading…
[Fix] Fix KV transfer metrics using wrong time window
#24415 opened May 5, 2026 by yangyonggit Loading…
7 tasks done
[vlm][pixtral] support precomputed embeddings + processor output Multi-modal multi-modal language model npu
#24412 opened May 5, 2026 by 1fanwang Draft
5 tasks done
Feat/true on policy qwen moe deterministic Issues on deterministic inference/kernels documentation Improvements or additions to documentation Multi-modal multi-modal language model npu quant LLM Quantization
#24408 opened May 5, 2026 by maocheng23 Contributor Draft
5 tasks
[Test] Add unit tests for srt/layers/utils/logprob.py
#24406 opened May 5, 2026 by dsuarez01 Loading…
9 of 11 tasks
ProTip! Filter pull requests by the default branch with base:main.