-
Notifications
You must be signed in to change notification settings - Fork 4k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support pre-generating and using expected checksums
#16730
opened Jan 8, 2026 by
fzyzcjy
Loading…
5 tasks
Adapt cann 8.5: use sfa and lightning indexer op from cann
npu
#16728
opened Jan 8, 2026 by
randgun
Loading…
5 tasks
[Fix] Apply --dist-timeout to GroupCoordinator process groups
#16726
opened Jan 8, 2026 by
luoyuyan
Loading…
5 tasks
support smem in per_token_quant_fp8 kernel
quant
LLM Quantization
sgl-kernel
#16725
opened Jan 8, 2026 by
zhangxin81
Loading…
5 tasks
[Rework] Add SwapAB Optimization for triton fused_moe_kernel on SM90.
run-ci
#16723
opened Jan 8, 2026 by
Insideyyy
Loading…
1 of 5 tasks
[diffusion] model: wan tp+usp optimize
diffusion
SGLang Diffusion
run-ci
#16720
opened Jan 8, 2026 by
triple-mu
Loading…
5 tasks
[Diffusion] model: fix zimage tp
diffusion
SGLang Diffusion
run-ci
#16719
opened Jan 8, 2026 by
CPFLAME
Loading…
Skip causal_conv1d test with padded batches due to Triton kernel bug
#16715
opened Jan 8, 2026 by
alisonshao
Loading…
Add force-include-usage Support for stream
#16711
opened Jan 8, 2026 by
syd520zy
Loading…
2 of 5 tasks
Fix AMD CI suite names in pr-test-amd.yml
amd
run-ci
#16710
opened Jan 8, 2026 by
alisonshao
Loading…
[smg][ci] migrate chat completions tests to new infrastructure and build wheel once and share via artifact
model-gateway
run-ci
#16709
opened Jan 8, 2026 by
slin1237
Loading…
5 tasks
[Rocm][Feat] Accelerate VisionAttention by precompute H2D part in every vis…
Multi-modal
multi-modal language model
#16705
opened Jan 8, 2026 by
ZLkanyo009
Loading…
[diffusion] feat: support decode with large num frames
diffusion
SGLang Diffusion
#16704
opened Jan 8, 2026 by
zcnrex
Loading…
5 tasks
[PCG]Add print tensor op for debug
piecewise-cuda-graph
#16702
opened Jan 8, 2026 by
Chen-0210
Loading…
1 task done
qwen3_vl encoder support graph
Multi-modal
multi-modal language model
npu
quant
LLM Quantization
#16700
opened Jan 8, 2026 by
chenxu214
Loading…
5 tasks
feat(ascend): add runtime support for GPTQ-quantized MoE models + AutoRound-quantized dense and MoE models
#16699
opened Jan 8, 2026 by
GuoYechang
Loading…
fix(function_call): group batch decode by options instead of fallback
#16698
opened Jan 8, 2026 by
ruokee
Loading…
1 of 5 tasks
Add some more metrics
documentation
Improvements or additions to documentation
#16694
opened Jan 8, 2026 by
vincentzed
•
Draft
5 tasks
fix: auto-append <|Assistant|> when last message is not user/dev
#16689
opened Jan 8, 2026 by
joeyhacker
Loading…
[Quantization][NPU] support gptq moe layer on npu
#16688
opened Jan 8, 2026 by
22dimensions
•
Draft
5 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.