NVIDIA Metropolis VSS + Twelve Labs

Standalone integration of NVIDIA Metropolis Video Search and Summarization (VSS) with Twelve Labs for video understanding.

Provides:

Video chunking with FFmpeg and upload to NVIDIA VSS
Marengo 3.0 embeddings for semantic search and kNN anomaly detection
Pegasus 1.2 for natural language video Q&A
Async upload pipeline with parallel chunk handling

This integration is used in the Sentinel video anomaly detection platform.

Requirements

Python 3.11+
uv
FFmpeg (for video chunking)
NVIDIA GPU (for running VSS locally) or a remote VSS endpoint
Twelve Labs API key

Setup

git clone https://github.com/qdrant/twelvelabs-nvidia-vss
cd twelvelabs-nvidia-vss
uv sync

cp .env.example .env
# Edit .env with your Twelve Labs API key and VSS URL

NVIDIA VSS

To run VSS locally (requires NVIDIA GPU and NGC access):

docker compose up

This starts the VSS server on port 8080 with Twelve Labs configured as the VLM backend.

For managed VSS deployment on Vultr Cloud GPUs (A100, H100, H200, L40S), see the Vultr documentation.

Usage

Ingest a video

# Full pipeline: chunk + VSS upload + Twelve Labs indexing
uv run python scripts/ingest.py --video path/to/video.mp4

# Skip VSS, only index to Twelve Labs
uv run python scripts/ingest.py --video path/to/video.mp4 --skip-vss

# Index only for Marengo (embeddings/search), skip Pegasus
uv run python scripts/ingest.py --video path/to/video.mp4 --index-type marengo

Search indexed videos

uv run python scripts/search.py --query "person running near entrance" --max-results 5

Ask questions about a video

uv run python scripts/analyze.py \
    --video-id <pegasus_video_id> \
    --prompt "What is happening in this video? Are there any unusual events?"

Library usage

import asyncio
from src import vss_client, twelvelabs_client

# Check VSS health
health = asyncio.run(vss_client.health())
print(health)

# Upload a video to Twelve Labs
result = twelvelabs_client.upload_video("clip.mp4", index_type="marengo")
video_id = result["marengo_video_id"]

# Get the embedding vector (1024-dimensional for Marengo 3.0)
embedding = twelvelabs_client.get_video_embedding(video_id)

# Semantic search
results = twelvelabs_client.search_videos("fighting or aggressive behavior", max_results=10)
for r in results:
    print(f"{r.video_id}: score={r.score:.4f}, {r.start:.1f}s-{r.end:.1f}s")

# Video Q&A
analysis = twelvelabs_client.analyze_video(video_id, "Describe what happens in this clip.")
print(analysis.text)

Architecture

video.mp4
    |
    v
chunk_video()          # FFmpeg segment muxer -> N x ~30s chunks
    |
    +---> VSS          # NVIDIA Metropolis: VLM captioning, Graph-RAG, CV pipeline
    |
    +---> Twelve Labs  # Marengo: embedding + semantic search
                       # Pegasus: natural language Q&A

The Marengo embedding (1024-dim) can be indexed into Qdrant for kNN-based anomaly detection. See qdrant/video-anomaly-edge for a complete production implementation.

Credits

Built on top of original work by Nathan Chess (nathanchess/twelvelabs-nvidia-vss-sample) and James Le (james-le-twelve-labs/nvidia-vss).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NVIDIA Metropolis VSS + Twelve Labs

Requirements

Setup

NVIDIA VSS

Usage

Ingest a video

Search indexed videos

Ask questions about a video

Library usage

Architecture

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NVIDIA Metropolis VSS + Twelve Labs

Requirements

Setup

NVIDIA VSS

Usage

Ingest a video

Search indexed videos

Ask questions about a video

Library usage

Architecture

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages