Releases: ROCm/rocFFT
rocFFT 1.0.36 for ROCm 7.2.0
Optimized
- Removed a potential unnecessary global transpose operation from MPI 3D multi-GPU pencil decompositions.
- Enabled optimization of 3D pencil decompositions for single-process multi-GPU transforms.
Resolved issues
- Fixed potential division by zero when constructing plans using dimensions of length 1.
- Fixed result scaling on multi-device transforms.
- Fixed callbacks on multi-device transforms.
rocfft 1.0.34 for ROCm 7.1.1
rocFFT code for ROCm 7.1.1 did not change. The library was rebuilt for the updated ROCm 7.1.1 stack.
rocfft 1.0.34 for ROCm 7.1.0
rocFFT code for ROCm 7.1.0 did not change. The library was rebuilt for the updated ROCm 7.1.0 stack.
rocfft 1.0.34 for ROCm 7.0.2
rocFFT code for ROCm 7.0.2 did not change. The library was rebuilt for the updated ROCm 7.0.2 stack.
rocfft 1.0.34 for ROCm 7.0.1
rocFFT code for ROCm 7.0.1 did not change. The library was rebuilt for the updated ROCm 7.0.1 stack.
rocFFT 1.0.34 for ROCm 7.0.0
Added
- Added gfx950 support.
Removed
- Removed rocfft-rider legacy compatibility from clients
- Removed support for the gfx940 and gfx941 targets from the client programs.
- Removed backward compatibility symlink for include directories.
Optimized
- Removed unnecessary HIP event/stream allocation and synchronization during MPI transforms.
- Implemented single-precision 1D kernels for lengths:
- 4704
- 5488
- 6144
- 6561
- 8192
- Implemented single-kernel plans for some large 1D problem sizes, on devices with at least 160KiB of LDS.
Resolved issues
- Fixed kernel faults on multi-device transforms that gather to a single device, when the input/output bricks are not
contiguous.
rocFFT 1.0.32 for ROCm 6.4.4
rocFFT code for ROCm 6.4.4 did not change. The library was rebuilt for the updated ROCm 6.4.4 stack.
rocFFT 1.0.32 for ROCm 6.4.3
rocFFT code for ROCm 6.4.3 did not change. The library was rebuilt for the updated ROCm 6.4.3 stack.
rocFFT 1.0.32 for ROCm 6.4.2
rocFFT code for ROCm 6.4.2 did not change. The library was rebuilt for the updated ROCm 6.4.2 stack.
rocFFT 1.0.32 for ROCm 6.4.1
rocFFT code for ROCm 6.4.1 did not change. The library was rebuilt for the updated ROCm 6.4.1 stack.