tianyuxbear

Follow

🎯

Focusing

Tianyu Xiong tianyuxbear

🎯

Focusing

Follow

Life is a battlefield of the mind.

9 followers · 21 following

NVIDIA
Shanghai, China
23:40 (UTC +08:00)

Achievements

Achievements

Pinned Loading

TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python
cutlass cutlass Public

Forked from NVIDIA/cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++
cuda-kernels cuda-kernels Public

A collection of high-performance CUDA kernels and experiments for learning and optimizing GPU compute primitives.

Cuda 1
matmul-cpu matmul-cpu Public

High-performance CPU GEMM kernels (C = A·Bᵀ + C) optimized for LLM inference, featuring AVX2/AVX-512 SIMD and multi-threading. Benchmarked against OpenBLAS.

C++ 1
mini-sglang mini-sglang Public

Forked from sgl-project/mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python
nano-vllm nano-vllm Public

Forked from GeeeekExplorer/nano-vllm

Nano vLLM

Python