Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support MiniMax-M2 in TurboMind engine
#4343 opened Feb 10, 2026 by zh-nj Loading…
fix qwen3-vl-moe long context
#4342 opened Feb 9, 2026 by grimoire Loading…
fa3 check Bug:P1
#4340 opened Feb 9, 2026 by grimoire Loading…
Fix time series preprocess Bug:P1
#4339 opened Feb 9, 2026 by CUHKSZzxy Loading…
Fix authorization
#4338 opened Feb 9, 2026 by lvhan028 Loading…
[WIP]Support torch compile
#4336 opened Feb 8, 2026 by grimoire Draft
Qwen Dense/Moe model fp8 quant online
#4324 opened Feb 5, 2026 by 43758726 Loading…
return BadRequest for all invlid inputs Bug:P2
#4291 opened Jan 26, 2026 by lvhan028 Loading…
support repetition ngram logits processor
#4288 opened Jan 23, 2026 by grimoire Loading…
fix dllm mask on set_step
#4278 opened Jan 18, 2026 by grimoire Loading…
[ascend] fix awq and smoothq
#4277 opened Jan 16, 2026 by wanfengcxz Draft
test: add mixing guided and non-guided tests
#4267 opened Jan 12, 2026 by windreamer Loading…
Update benchmark serving script for proxy_server
#4173 opened Dec 1, 2025 by lvhan028 Loading…
Update installation.md
#4095 opened Nov 3, 2025 by krescent Loading…
Add step_map to track token decoding order in DLLM
#4057 opened Oct 21, 2025 by Auraithm Loading…
4 tasks done
[POC] Encoder Disaggregation
#4047 opened Oct 17, 2025 by CUHKSZzxy Draft
2 of 7 tasks
quant blocked fp8 enhancement New feature or request
#4018 opened Sep 29, 2025 by CUHKSZzxy Loading…
4 of 5 tasks
ProTip! no:milestone will show everything without a milestone.