deepspeedai / DeepSpeed Public

Notifications You must be signed in to change notification settings
Fork 4.7k
Star 41.8k

Code
Issues 1.2k
Pull requests 121
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: deepspeedai/DeepSpeed

Labels 31 Milestones 0

New pull request New

121 Open 3,530 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Bugfix] Validate fp16.loss_scale is finite in DeepSpeedFP16Config

#7892 opened Mar 8, 2026 by s-zx

Loading…

[Bloom] Fix hangs of bloom test

#7890 opened Mar 6, 2026 by k-artem

Loading…

fix: Validate fp16.loss_scale is finite and non-negative

#7889 opened Mar 6, 2026 by nathon-lee

Loading…

Fix Stage 0 + Ulysses crash: make bwc_tensor_model_parallel_rank() resilient to MP API absence

#7888 opened Mar 6, 2026 by nathon-lee

Loading…

[SP] add SP deny list instead of allow

#7887 opened Mar 5, 2026 by kashif

Loading…

Fix Evoformer's multi-arch dispatch root cause

#7881 opened Mar 2, 2026 by tohtana

Loading…

fix(zero): Ensure full gradient reduction for Muon optimizer with reduce_scatter

#7878 opened Feb 27, 2026 by nathon-lee

Loading…

fix: correct DistributedAttention output shape and pad uneven sequence lengths (#7842)

#7868 opened Feb 22, 2026 by harshang03 • Draft

fix: keep fp32-pinned parameters out of the bf16 cast path in ZeRO-3 (#7747)

#7867 opened Feb 22, 2026 by harshang03 • Draft

Revert "fix: remove premature MPI environment variable check in OpenMPIRunner"

#7864 opened Feb 21, 2026 by mikloorbi-sys • Draft

Merging AutoSP into DeepSpeed

#7860 opened Feb 19, 2026 by neeldani

Loading…

Fix global .cuh ignore and enforce tracked CUDA headers

#7858 opened Feb 18, 2026 by harshang03 • Draft

Fix ZeRO legacy grad-hook crash when next_functions is missing

#7857 opened Feb 17, 2026 by harshang03 • Draft

Reject non-finite fp16 loss_scale across config and ZeRO paths

#7856 opened Feb 17, 2026 by harshang03 • Draft

Fix zero/division safety gaps in utility and inference paths

#7855 opened Feb 17, 2026 by harshang03 • Draft

Fix count_used_parameters_in_backward crash on PyTorch < 2.3 (#7756)

#7849 opened Feb 12, 2026 by harshang03 • Draft

[BUG] Fix: Fix gradient norm calculation and dynamic shape blocking in PP+ZeRO1 collective communication

#7847 opened Feb 12, 2026 by Thinksky5124

Loading…

Fix subgroup optimizer metadata inconsistency

#7820 opened Jan 27, 2026 by st-bang97

Loading…

[Draft] Muon Optimizer Support for ZeRO3

#7798 opened Jan 20, 2026 by PKUWZP

Loading…

Fix bf16 dtype mismatch in ZeRO-3 with zero_quantized_weights

#7792 opened Jan 18, 2026 by juyterman1000

Loading…

Fix Muon optimizer conflict with gradient clipping in ZeRO 1/2

#7776 opened Jan 12, 2026 by fy817

Loading…

Fix: ZenFlow Adam integration for updated PyTorch backward flow (#7759)

#7771 opened Jan 11, 2026 by Antlera

Loading…

Fix Muon optimizer checkpoint resume with bf16 mode

#7748 opened Dec 28, 2025 by yurekami

Loading…

2 tasks done

Introduce Megatron-style parallel state management

#7726 opened Dec 15, 2025 by eternalNight

Loading…

5 tasks done

let allgather and alltoall execute in parallel when both attention and MOE used TP

#7723 opened Dec 11, 2025 by taozhiwei

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!