Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][infra] Waive failed A10-PyTorch-1 test in pre-merge
#12207 opened Mar 13, 2026 by yuanjingx87 Loading…
1 task done
[TRTLLM-11357][feat] Support interleaved thinking for trtllm-serve
#12199 opened Mar 13, 2026 by JunyiXu-nv Loading…
1 task done
[https://nvbugs/5948878][fix] fix lost requests
#12197 opened Mar 13, 2026 by bo-nv Loading…
1 task
[None][doc] Blog18 for NVLinkOneSided AlltoAll.
#12195 opened Mar 13, 2026 by bobboli Loading…
1 task
[None][chore] Alltoall benchmark script refine (second time).
#12192 opened Mar 13, 2026 by bobboli Loading…
1 task
[TRTLLM-11267][feat] Add audio support for nemotron
#12191 opened Mar 13, 2026 by 2ez4bz Loading…
1 task done
[None][fix] Fix W4A16 AWQ bias not applied on SM100 (Blackwell)
#12190 opened Mar 13, 2026 by Tracin Loading…
1 task done
[None][fix] Add more models to increase perf test coverage
#12184 opened Mar 13, 2026 by chenfeiz0326 Loading…
1 task done
[https://nvbugs/5879588][fix] fix MiniMax model loading bugs
#12182 opened Mar 13, 2026 by jmydurant Loading…
1 task
[None][fix] fix mooncake dynamic load in transfer_agent_binding
#12181 opened Mar 13, 2026 by chuangz0 Loading…
1 task done
ProTip! no:milestone will show everything without a milestone.