collab-ollama

Run Ollama on a Google Colab GPU and access it from your local machine via a Cloudflare Tunnel — no account or configuration required.

This is useful when models run too slow on your local machine, or when you need GPU-accelerated inference for synthetic data generation in large batches.

Quick Start

Run directly in a Colab cell with uvx — no install step needed:

!uvx collab-ollama

Or with a specific model:

!uvx collab-ollama -m gemma:2b

Alternative: pip install

If you prefer a traditional install:

!pip install collab-ollama
!collab-ollama

Specifying a Model

By default, phi3:mini is pulled and served. Use the -m / --model flag to choose a different model:

!uvx collab-ollama --model llama3:8b
!uvx collab-ollama -m gemma:2b

Once setup is complete, you'll see output like:

Setup is complete!

  Base URL : https://xxxx-xxxxx-xxxxx-xxxxx.trycloudflare.com/v1/
  API Key  : No key required — leave it blank or use any string
  Model    : gemma:2b

Usage

Use the printed Base URL and Model with any OpenAI-compatible client. No API key is needed — leave it blank or pass any arbitrary string.

Ollama CLI

On your local machine, set OLLAMA_HOST to the base URL (without /v1/) and use the Ollama CLI as usual. Inference runs on the Colab GPU, but the experience feels local. Make sure you have the Ollama CLI installed locally.

export OLLAMA_HOST='https://xxxx-xxxxx-xxxxx-xxxxx.trycloudflare.com'
ollama run gemma:2b --verbose

You can pull and run any model that fits in the Colab GPU memory:

ollama pull llama3:8b
ollama run llama3:8b

Python (OpenAI SDK)

Ollama exposes an OpenAI-compatible API. Install the SDK and use the Base URL directly:

pip install openai

from openai import OpenAI

client = OpenAI(
    base_url="https://xxxx-xxxxx-xxxxx-xxxxx.trycloudflare.com/v1/",
    api_key="ollama",  # any string works, or leave blank
)

response = client.chat.completions.create(
    model="gemma:2b",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello!"},
    ],
)
print(response.choices[0].message.content)

Node.js (OpenAI SDK)

Install the SDK:

npm install openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://xxxx-xxxxx-xxxxx-xxxxx.trycloudflare.com/v1/",
  apiKey: "ollama", // any string works, or leave blank
});

const response = await client.chat.completions.create({
  model: "gemma:2b",
  messages: [
    { role: "system", content: "You are a helpful assistant." },
    { role: "user", content: "Hello!" },
  ],
});
console.log(response.choices[0].message.content);

How It Works

Installs Ollama if not already present.
Installs Cloudflared if not already present.
Starts ollama serve with OLLAMA_ORIGINS=* for broad CORS support.
Pulls the specified model (default phi3:mini).
Opens a Cloudflare quick tunnel to localhost:11434 and prints the Base URL, API Key info, and Model name.

Requirements

Colab: A Google Colab notebook with a GPU runtime.
Local machine: Ollama CLI (for CLI usage), Python with openai, or Node.js with openai.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
src/collab_ollama		src/collab_ollama
.gitignore		.gitignore
.python-version		.python-version
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

collab-ollama

Quick Start

Alternative: pip install

Specifying a Model

Usage

Ollama CLI

Python (OpenAI SDK)

Node.js (OpenAI SDK)

How It Works

Requirements

About

Uh oh!

Releases 4

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

collab-ollama

Quick Start

Alternative: pip install

Specifying a Model

Usage

Ollama CLI

Python (OpenAI SDK)

Node.js (OpenAI SDK)

How It Works

Requirements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages