Skip to content

Pull requests: deepspeedai/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix loss scaling and backward call of ZenFlow
#7793 opened Jan 18, 2026 by tohtana Loading…
Add bf16 model with fp32 grad_accum to supported configs
#7790 opened Jan 18, 2026 by tohtana Loading…
Skip empty parameters in gradient reduction
#7789 opened Jan 18, 2026 by tohtana Loading…
Fix issue with BF16 optimizer selection
#7788 opened Jan 18, 2026 by tohtana Loading…
Fix backward for pipeline engine
#7787 opened Jan 18, 2026 by tohtana Loading…
fix checkpointing/loading of z0+bf16
#7786 opened Jan 17, 2026 by tohtana Loading…
Add workflow to run full tests
#7783 opened Jan 17, 2026 by tohtana Draft
Fix Muon optimizer checkpoint resume with bf16 mode
#7748 opened Dec 28, 2025 by yurekami Loading…
2 tasks done
Add sequential allgather optimization for ZeRO-3
#7661 opened Oct 31, 2025 by aeeeeeep Loading…
Configures workflow for offline unit tests
#7512 opened Aug 24, 2025 by porfanid Loading…
Add world-size getter in Engine
#7479 opened Aug 9, 2025 by WoosungMyung Loading…
Create COMMITTERS_RESPONSIBILITY.md
#7300 opened May 21, 2025 by PKUWZP Loading…
gather output layout support for column parallel
#7181 opened Mar 28, 2025 by inkcherry Loading…
ProTip! no:milestone will show everything without a milestone.