I have tried a lot of options and version in order to build it, but...
It's simply a pain to build it regardless of which version of CUDA you have
I you want versatility for any platform - Don't use mamba, looks at mambaout or other project, because compiling mamba it's a real PAIN
99% of issues - people can't install it, people can't build it, people can't use it
I am using arch linux
I have downgraded my cuda from 13.1 to 12.8
I have switched to python 3.10
I have switched gcc compiler to 14
I have downgraded my pytoch to 2.4
I have tried different build options, nothing helps and nothing works