Skip to content

Deferred GPU-resident sampling and pre-allocated decode tensors#779

Open
mitiskuma wants to merge 1 commit intomlc-ai:mainfrom
mitiskuma:Deferred-GPU-resident-sampling-and-pre-allocated-decode-tensors
Open

Deferred GPU-resident sampling and pre-allocated decode tensors#779
mitiskuma wants to merge 1 commit intomlc-ai:mainfrom
mitiskuma:Deferred-GPU-resident-sampling-and-pre-allocated-decode-tensors

Commits

Commits on Mar 4, 2026