Skip to content

[hipblaslt] tensilelite client validation threshold fix for tf32x3 and tf32x1#4890

Open
carsonbrownlee wants to merge 4 commits intodevelopfrom
users/cbrownle/tf32_accuracyfix
Open

[hipblaslt] tensilelite client validation threshold fix for tf32x3 and tf32x1#4890
carsonbrownlee wants to merge 4 commits intodevelopfrom
users/cbrownle/tf32_accuracyfix

Conversation

@carsonbrownlee
Copy link
Contributor

@carsonbrownlee carsonbrownlee commented Feb 25, 2026

Motivation

enable trig_init comparisons for tensilelite for tf32x3 and tf32x1.

Technical Details

Modify hardcoded thresholds in tensilelite validation to vary based on sqrt(k). Currently only implemented for float comparison for tf32x3 and tf32x1.

Test Plan

Tested with tensilelite yamls, modifying Tensile/Tests/common/gemm/gfx950/xfp32.yaml and ss_bss.yaml to have datainittypeA/B: 12/13. Only xfp32.yaml (tf32x3) test modified to use trig init as it makes more sense to test the lower bits by default. ss_bss.yaml not changed and needs to be modified to test.

Test Result

PASS

@carsonbrownlee carsonbrownlee changed the title tensilelite client validation threshold fix for tf32x3 and tf32x1 [hipblaslt] tensilelite client validation threshold fix for tf32x3 and tf32x1 Feb 25, 2026
@math-ci-webhook
Copy link

perfci run on commit 49fe485

math-ci run

@math-ci-webhook
Copy link

perfci run on commit 4cad8a2

math-ci run

@codecov-commenter
Copy link

codecov-commenter commented Feb 25, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.

❌ Your project status has failed because the head coverage (76.83%) is below the target coverage (80.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files
@@           Coverage Diff            @@
##           develop    #4890   +/-   ##
========================================
  Coverage    65.35%   65.35%           
========================================
  Files         1718     1718           
  Lines       267217   267217           
  Branches     37047    37047           
========================================
  Hits        174637   174637           
  Misses       77072    77072           
  Partials     15508    15508           
Flag Coverage Δ *Carryforward flag
hipBLAS 90.67% <ø> (ø) Carriedforward from 4cad8a2
hipBLASLt 43.55% <ø> (ø)
hipCUB 81.98% <ø> (ø) Carriedforward from 4cad8a2
hipDNN 80.74% <ø> (ø) Carriedforward from 4cad8a2
hipFFT 55.93% <ø> (ø) Carriedforward from 4cad8a2
hipRAND 76.12% <ø> (ø) Carriedforward from 4cad8a2
hipSOLVER 68.81% <ø> (ø) Carriedforward from 4cad8a2
hipSPARSE 84.70% <ø> (ø) Carriedforward from 4cad8a2
rocBLAS 47.97% <ø> (ø) Carriedforward from 4cad8a2
rocFFT 47.75% <ø> (ø) Carriedforward from 4cad8a2
rocRAND 57.06% <ø> (ø) Carriedforward from 4cad8a2
rocSOLVER 76.83% <ø> (ø) Carriedforward from 4cad8a2
rocSPARSE 71.53% <ø> (ø) Carriedforward from 4cad8a2

*This pull request uses carry forward flags. Click here to find out more.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants