Suggestion Description
Hi there - could the MLA kernel be open source? That way we can optimize it for our particular shapes.
Specifically looking at: mla_a8w8_qh128_m32x4_n16x2_msk0_ps.so
Operating System
No response
GPU
No response
ROCm Component
No response