> we can rewrite this loop in a much better way: https://godbolt.org/z/dexnM8Y3a _Originally posted by @fbusato in [#7346](https://github.com/NVIDIA/cccl/pull/7346/changes#r2805184382)_