Refactoring all test scripts to pytests and fixing CUDA graphs compatibility by akshaysubr · Pull Request #13 · NVlabs/cuHPX

akshaysubr · 2026-01-22T01:40:03Z

This PR includes three major improvements to cuHPX:

Test Suite Refactoring - Modernized all tests to use pytest with proper fixtures and dtype parameterization
CUDA Graph Compatibility - Added support for capturing SHTCUDA/iSHTCUDA operations in CUDA graphs
Bug Fix - Fixed segmentation fault caused by dangling CUDA stream references

- Convert all test scripts to pytest format with fixtures and assertions - Add conftest.py with shared fixtures (device, nside, dtype, tolerances) - Parameterize all SHT tests with both float32 and float64 dtypes - Add tolerance helpers: get_impl_tol(), get_roundtrip_tol(), get_bluestein_tol() - Add pytest configuration to pyproject.toml - Move profiling scripts to benchmarks/ and examples/ directories - Remove obsolete test_sht_two_stream_overlap_profiling.py Signed-off-by: Akshay Subramaniam <6964110+akshaysubr@users.noreply.github.com>

Copilot

Pull request overview

This PR modernizes the cuHPX test suite and adds critical CUDA graph support for production workloads.

Changes:

Refactored all test scripts from interactive input-based scripts to pytest-based unit tests with proper fixtures and parameterization
Added CUDA graph compatibility by fixing stream handling and pre-allocating buffers in SHTCUDA/iSHTCUDA operations
Fixed segmentation fault caused by dangling CUDA stream references in HealpixFFT/HealpixIFFT classes

Reviewed changes

Copilot reviewed 20 out of 20 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
tests/test_sht_two_stream_overlap_profiling.py	Removed old profiling test (replaced by benchmark script)
tests/test_sht_cuda_stream.py	Converted to pytest with CUDA stream validation tests
tests/test_sht_cuda_batch.py	Converted to pytest with batched operation tests
tests/test_sht_cuda.py	Converted to pytest with CUDA vs PyTorch comparison tests
tests/test_sht_bluestein.py	Converted to pytest with Bluestein algorithm tests
tests/test_regridding.py	Converted to pytest with grid regridding tests
tests/test_harmonic_transform.py	Converted to pytest with SHT/iSHT roundtrip tests
tests/test_grad.py	Converted to pytest with gradient flow tests
tests/test_differentiability.py	Converted to pytest with gradient-based optimization tests
tests/test_data_remapping.py	Converted to pytest with ring/nest remapping tests
tests/test_cuda_graphs.py	New file with comprehensive CUDA graph capture tests
tests/test_batch_remap.py	Converted to pytest with batched remapping tests
tests/conftest.py	New pytest configuration with shared fixtures and tolerance helpers
src/harmonic_transform/hpx_fft.h	Fixed stream handling and added pre-allocated buffers for CUDA graphs
src/harmonic_transform/hpx_fft.cpp	Updated to use pre-allocated buffers and in-place operations
pyproject.toml	Added pytest configuration and test dependencies
examples/wmap_optimization.py	New example demonstrating differentiable SHT on WMAP data
cuhpx/hpx_sht.py	Fixed stream handling and pre-computed weights for CUDA graph compatibility
benchmarks/stream_overlap_profiling.py	New benchmark for profiling stream overlap
benchmarks/sht_profiling.py	New benchmark for profiling SHT operations

Comments suppressed due to low confidence (1)

src/harmonic_transform/hpx_fft.h:1

The condition n_ < n only triggers reconfiguration when the new batch size is larger. If n becomes smaller (e.g., from 8 to 4), no reconfiguration occurs, but the pre-allocated buffers sized for n_=8 will be reused. While the code uses slicing to handle this (line 77), the comment on line 133 is misleading - it should clarify that buffers are sized for the maximum batch size seen, not just 'when batch grows larger'.

/*

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/harmonic_transform/hpx_fft.h

tests/test_differentiability.py

tests/test_sht_bluestein.py

tests/test_harmonic_transform.py

cuhpx/hpx_sht.py

tests/conftest.py

tests/test_cuda_graphs.py

xiaopoc · 2026-01-23T22:10:46Z

I ran the pytest suite on my end and observed 3 test failures. It seems that these failures occur with precision checks and are related to numerical tolerances thresholds being set too tightly. Maybe we can try the following to fix?

tests/conftest.py (line 86) – Increase atol for float32 implementation comparisons

   # Current
   torch.float32: {"sht": (1e-4, 1e-5), "isht": (1e-3, 1e-2)},
   # Suggested
   torch.float32: {"sht": (1e-4, 2e-05), "isht": (1e-3, 0.1)},

tests/test_sht_cuda_batch.py (line 66) – Increase atol for batch vs single comparison

   # Current
   atol = 1e-6 if dtype == torch.float32 else 1e-10
   # Suggested
   atol = 1e-4 if dtype == torch.float32 else 1e-10

akshaysubr · 2026-02-12T17:36:56Z

@xiaopoc Thanks for the review and independent testing. What GPU, CUDA version, etc were you testing with?

…tures

xiaopoc

All tests passed. The PR looks good to me, so approved it.

akshaysubr added 2 commits January 21, 2026 17:33

Adding fix for CUDA graphs compatibility and tests for that

7a895af

akshaysubr requested review from Copilot and xiaopoc January 22, 2026 01:40

Copilot AI reviewed Jan 22, 2026

View reviewed changes

akshaysubr added 2 commits February 24, 2026 21:30

Updating error tolerances to pass on multiple system and GPU architec…

6a1f247

…tures

Addressing review comments

00c3958

xiaopoc approved these changes Mar 6, 2026

View reviewed changes

akshaysubr merged commit 1e37484 into main Mar 6, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring all test scripts to pytests and fixing CUDA graphs compatibility#13

Refactoring all test scripts to pytests and fixing CUDA graphs compatibility#13
akshaysubr merged 4 commits intomainfrom
testing-refactor

akshaysubr commented Jan 22, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xiaopoc commented Jan 23, 2026

Uh oh!

akshaysubr commented Feb 12, 2026

Uh oh!

xiaopoc left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

akshaysubr commented Jan 22, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xiaopoc commented Jan 23, 2026

Uh oh!

akshaysubr commented Feb 12, 2026

Uh oh!

xiaopoc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants