-
Notifications
You must be signed in to change notification settings - Fork 692
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature] Add guided decoding support for speculative decoding
#4559
opened Apr 28, 2026 by
windreamer
Collaborator
Loading…
4 tasks done
Update turbomind modeling infrastructure
improvement
#4557
opened Apr 27, 2026 by
lzhangzz
Collaborator
Loading…
Fix cache sizing and cache block layout edge cases
Bug:P1
#4552
opened Apr 23, 2026 by
grimoire
Collaborator
Loading…
[Feature] Implement New feature or request
/v1/embeddings endpoint for OpenAI-compatible API
enhancement
#4550
opened Apr 23, 2026 by
ZhijunLStudio
Contributor
Loading…
2 of 4 tasks
feat: add Anthropic-compatible serving endpoints
enhancement
New feature or request
#4538
opened Apr 19, 2026 by
lvhan028
Collaborator
Loading…
Test: update video sleep/wakeup and abort scenarios
#4528
opened Apr 15, 2026 by
littlegy
Contributor
Loading…
style: add autopep8 pre-commit hook and apply PEP 8 formatting fixes
#4524
opened Apr 14, 2026 by
windreamer
Collaborator
Loading…
add explicit trust_remote_code controls to resolve the security issue
improvement
#4511
opened Apr 8, 2026 by
lvhan028
Collaborator
Loading…
make fp8 model quantized by llm-compressor can be inferenced in turbomind
enhancement
New feature or request
#4509
opened Apr 8, 2026 by
43758726
Collaborator
Loading…
support more message item types
improvement
#4501
opened Apr 7, 2026 by
CUHKSZzxy
Collaborator
Loading…
Integrate deep-ep nccl backend
enhancement
New feature or request
#4477
opened Mar 27, 2026 by
irexyc
Collaborator
Loading…
feat: Turbomind linear gdn prefix caching
enhancement
New feature or request
#4465
opened Mar 25, 2026 by
lapy
Contributor
Loading…
feat: implement Turbomind vision encoder support for Qwen3VL/3.5 families
enhancement
New feature or request
#4460
opened Mar 24, 2026 by
lapy
Contributor
Loading…
[Feature] Support n parameter in /v1/chat/completions and /v1/completions
improvement
#4419
opened Mar 17, 2026 by
ziyangliu-666
Loading…
Add model deployment best practice section in user guide
documentation
Improvements or additions to documentation
Fix Structured Output for GPT-OSS Models
#4386
opened Mar 2, 2026 by
windreamer
Collaborator
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.