Skip to content

Pull requests: GeeeekExplorer/nano-vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix scheduler.postprocess return type
#200 opened Apr 11, 2026 by KinglittleQ Loading…
docs: use hf download in README
#194 opened Apr 2, 2026 by sablecode Loading…
fix: correct postprocess return type annotation
#192 opened Mar 30, 2026 by Desirer Loading…
Fix CUDA graph block_tables shape mismatch
#191 opened Mar 24, 2026 by ilrewrite Loading…
Feature/support llama3
#188 opened Mar 21, 2026 by wudong5 Loading…
fix: update download command for model weights in README
#185 opened Mar 12, 2026 by SYaoJun Loading…
docs: add Chinese README and language links
#183 opened Mar 8, 2026 by LJS1124 Loading…
add a Dockerfile for nano-vllm
#178 opened Mar 3, 2026 by pacoxu Loading…
[Doc]Add Repository Architecture Overview Document
#177 opened Feb 26, 2026 by CalvinXKY Loading…
Update embed_head.py
#174 opened Feb 21, 2026 by TianduoWang Loading…
enable 'slots=True' for dataclasses
#172 opened Feb 9, 2026 by IceCreamMilkyTea Loading…
fix: modify input when input is fp32
#171 opened Feb 8, 2026 by philhuan Loading…
fix(rms_norm): add copy for residual
#169 opened Jan 28, 2026 by tpoisonooo Loading…
test
#160 opened Jan 15, 2026 by volcano98 Loading…
remove hard code for block_size
#148 opened Dec 29, 2025 by guodongxiaren Loading…
bug for tensor parallelism # issue 144
#145 opened Dec 17, 2025 by LiaoMengqi Loading…
ProTip! What’s not been updated in a month: updated:<2026-03-12.