-
Notifications
You must be signed in to change notification settings - Fork 3.5k
feat: add CLI command to download local-inference models (no goosed server required) #8282
Copy link
Copy link
Open
Description
Problem
Downloading local GGUF models for the local-inference feature currently requires:
- Starting the
goosed agentHTTP server - Knowing the secret key (randomly generated, printed to stderr as
GOOSE_SERVER__SECRET_KEYor needingGOOSE_SERVER__SECRET_KEYenv var) - Using
curlwith the correct API endpoint, headers, and URL-encoded model IDs:curl -sk -X POST -H "X-Secret-Key: $SECRET" -H "Content-Type: application/json" \ -d '{"spec":"bartowski/Llama-3.2-1B-Instruct-GGUF:Q4_K_M"}' \ https://127.0.0.1:3000/local-inference/download
- Monitoring progress via a separate GET endpoint
- Stopping the goosed server when done
This is especially painful when:
- Configuring
localprovider viagoose configure— it says "Model not found" with no hint about downloading - Setting up the toolshim interpreter —
GOOSE_TOOLSHIM=truefails with cryptic Ollama errors if no local model is downloaded - Running headless / on servers where there is no desktop UI
Proposed Solution
Add a CLI subcommand for model management:
# List available models and their download status
goose models list
# Download a model
goose models download bartowski/Llama-3.2-1B-Instruct-GGUF:Q4_K_M
# Show download progress
goose models status
# Remove a downloaded model
goose models remove <model-id>Benefits
- No need to start goosed server just to download a model
goose configureforLocal Inferencecould auto-prompt to download if model not found- Toolshim setup becomes: set env var +
goose models download— no Ollama, no goosed
Current Workaround
Start goosed with a known secret key and use the HTTP API:
$env:GOOSE_SERVER__SECRET_KEY = "localtest123"
goosed agent
# In another terminal:
curl -sk -X POST -H "X-Secret-Key: localtest123" -H "Content-Type: application/json" -d '{"spec":"bartowski/Llama-3.2-1B-Instruct-GGUF:Q4_K_M"}' https://127.0.0.1:3000/local-inference/downloadRelated
- feat: allow configuring provider for developer extension's message augmentation (hardcoded Ollama dependency) #8270 — Developer extension hardcodes Ollama for message augmentation
- bug: Toolshim OllamaInterpreter returns noop for all streaming tool call fragments #8275 — Toolshim streaming fragment handling
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels