Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 721 127

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 413 66

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.7k 1.6k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.8k 237

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.1k 481

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.8k 986

Repositories

Showing 10 of 677 repositories
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 2,056 Apache-2.0 284 67 100 Updated Feb 26, 2026
  • NVSentinel Public

    NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

    NVIDIA/NVSentinel’s past year of commit activity
    Go 185 Apache-2.0 49 45 20 Updated Feb 26, 2026
  • bare-metal-manager-core Public

    NVIDIA Bare Metal Manager - Hardware Lifecycle Management and multitenant networking

    NVIDIA/bare-metal-manager-core’s past year of commit activity
    Rust 65 Apache-2.0 43 55 (3 issues need help) 21 Updated Feb 26, 2026
  • dsx-github-actions Public

    Github Action infrastructure for DSX

    NVIDIA/dsx-github-actions’s past year of commit activity
    Dockerfile 3 0 0 0 Updated Feb 26, 2026
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 941 342 431 (16 issues need help) 108 Updated Feb 26, 2026
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 12,949 2,128 537 542 Updated Feb 26, 2026
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 15,388 3,628 306 (1 issue needs help) 324 Updated Feb 26, 2026
  • bare-metal-manager-rest Public

    NVIDIA Bare Metal Management - Hardware Lifeceycle managment (REST API)

    NVIDIA/bare-metal-manager-rest’s past year of commit activity
    Go 21 Apache-2.0 17 5 13 Updated Feb 26, 2026
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
    C++ 2,182 346 1,270 (6 issues need help) 197 Updated Feb 26, 2026
  • OSMO Public

    The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML

    NVIDIA/OSMO’s past year of commit activity
    TypeScript 97 Apache-2.0 19 60 21 Updated Feb 26, 2026