Terrain

Pre-flight checks for AI/ML systems and the tests around them. Runs locally. No API key required.

Terrain is a static analyzer that treats tests, evals, prompts, and schemas as one dependency graph — and catches the drift that crosses their boundaries before the PR merges.

Install

# macOS / Linux
brew install pmclSF/terrain/mapterrain

# npm (Node 22+ required)
npm install -g mapterrain

# Go
go install github.com/pmclSF/terrain/cmd/terrain@latest

Pre-built binaries for macOS / Linux / Windows are on the releases page. Each release is signed with Sigstore + cosign; see SECURITY-DATA-HANDLING.md for verification details.

Get started

cd your-repo
terrain analyze         # What's the state of our AI + test system?
terrain report pr       # What does this change put at risk?

Works on repos in Python, TypeScript/JavaScript, Go, Java, and Ruby. No config required. Optional artifacts (coverage data, runtime test results, eval outputs) sharpen findings when present; analysis degrades gracefully when they're absent.

What it catches

Terrain models the AI surface alongside the test surface and looks for drift across them:

Prompt-schema drift — prompts that reference fields renamed in a schema living in a different language
Hardcoded API keys — provider-shaped secrets in source (OpenAI, Anthropic, AWS, GCP, etc.)
Eval coverage gaps — AI surfaces (prompts, agents, RAG pipelines) with no scenario covering them
Model deprecations — deprecated model IDs lingering in production paths
Cross-language edges — TS/JS ↔ Python/Go/Java via OpenAPI, tRPC, gRPC, GraphQL, HTTP routes
Framework-migration blockers — Jest ↔ Vitest, JUnit 4 ↔ 5, Cypress ↔ Playwright
Untested exports + weak assertions — public API surfaces with no covering test; assertions that pass on too much
Fixture fanout + duplicate clusters + skip debt — structural problems that erode CI signal

Each finding carries a stable rule ID, severity, confidence, evidence, and documented remediation. Run terrain explain <rule-id> for the long form.

What it looks like

Terrain · Test Suite Analysis
────────────────────────────────────────────────────────────

  conftest.py fixture fans out to 3,100 tests — any change retriggers the frame/ suite.

Key Findings
────────────────────────────────────────────────────────────
  1. [HIGH] 23 source files (18%) have low structural coverage
  2. [HIGH] 8 duplicate test clusters with 0.91+ similarity
  3. [MED]  34 xfail markers older than 180 days

Risk Posture
────────────────────────────────────────────────────────────
  health:                  Moderate
  coverage_depth:          Elevated
  coverage_diversity:      Strong
  structural_risk:         Strong
  Signals:                 65 (8 high, 34 medium, 23 low)

Representative output from a large pandas-style repository. Format and labels are stable across runs; the specific numbers vary by repo. Full sample reports for analyze, insights, impact, and explain are in docs/examples/.

Workflow

Command	Question
`terrain analyze`	What is the state of our test system?
`terrain report insights`	What should we fix?
`terrain report impact`	What validations matter for this change?
`terrain report explain <target>`	Why did Terrain make this decision?

The bare forms (terrain insights, terrain impact, terrain explain) work as aliases. AI / eval verbs, framework conversion, debug drill-downs, and the slash-receiver round out the surface — full reference in docs/cli-spec.md.

CI integration

# .github/workflows/terrain.yml
name: terrain
on: pull_request

jobs:
  analyze:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v6
        with: { fetch-depth: 0 }
      - uses: actions/setup-node@v6
        with: { node-version: '22.x' }
      - run: npm install -g mapterrain
      - run: |
          terrain test \
            --junit terrain-results.xml \
            --summary "$GITHUB_STEP_SUMMARY"

terrain test is the CI-mode wrapper. The JUnit XML lets your CI render Terrain findings as test cases; setting --summary "$GITHUB_STEP_SUMMARY" makes them appear on the workflow run page automatically.

For a blocking gate, add --fail-on=high (or --fail-on=critical for the strictest threshold). For onboarding a repo with existing debt, pair with --new-findings-only --baseline <path> so only regressions block the build. To restrict the gate to AI-related changes, add a paths: filter on prompt / schema / Python / TS file globs (see docs/examples/gate/ for the full templates).

What it is not

Not a test runner. Terrain reads what pytest / jest / go test / playwright produce. It doesn't execute tests.
Not a coverage tool. Terrain ingests coverage reports if you have them; it doesn't instrument code.
Not a source-code linter. Sonar / Semgrep / CodeQL stay better-positioned for application-code bug-finding.
Not an LLM eval framework. Terrain ingests artifacts Promptfoo / DeepEval / Ragas produce. It doesn't run the evals.
Not a productivity dashboard. No leaderboards, no per-developer metrics. Ownership data routes findings, never scores people.
Not a service. Analysis runs locally with zero outbound network calls — verifiable with terrain --print-network. The install paths download signed binaries from GitHub Releases; that's the only network step.

AI-aware testing

The same dependency graph that powers test selection for application code also traces AI surface edges. A prompt-template change triggers the right eval scenarios automatically — no manual mapping.

terrain ai run --base main — run only the evals your change affects
terrain report pr — flag changed AI surfaces with no covering eval
terrain ai findings — AI eval-gap findings with evidence chains

Two generators help adopters harden a prompt before deploying it:

terrain inject --prompt prompts/main.md       # generate jailbreak-shaped test inputs
terrain scaffold --schema schemas/input.json  # generate boundary-case mutation tests

Both emit runnable pytest / vitest scaffolds you drop into your test tree. Terrain never calls the model; the assertion is yours.

MCP integration

terrain mcp exposes the last analyze run to AI coding assistants (Claude Code, Cursor, others) over the Model Context Protocol. The assistant can query findings, drill into surfaces, and read baselines without you copy-pasting JSON — so "why is this PR failing Terrain?" gets a useful answer in the IDE instead of a context-switch to the terminal.

Plugins

Third-party detectors ship as YAML manifests; terrain plugins manifest <path> validates one against the stable schema. The plugin runtime — the loader that executes registered detectors — is reserved for a future release. Adopters can author and publish manifests now and they'll be loadable when the runtime ships. See examples/plugins/example-manifest.yaml.

Documentation

Get started

Quickstart — first report in 5 minutes
CLI specification — full command + flag reference
Example reports — analyze / impact / insights / explain samples

Reference

Design — architecture, package map, signal pipeline
Severity rubric — severity labels and configuration
Compatibility — supported OSes, Go versions, frameworks, schemas
Glossary — Terrain-specific vocabulary
Versioning — what counts as a breaking change

Project

CHANGELOG — release history
Security — supported versions + vulnerability disclosure
Contributing — how to build, test, and extend Terrain

License

Apache License 2.0 — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
.github		.github
.husky		.husky
.terrain		.terrain
benchmarks		benchmarks
bin		bin
cmd		cmd
docs		docs
examples		examples
extension/vscode		extension/vscode
harness		harness
internal		internal
rfcs		rfcs
schemas		schemas
scripts		scripts
tests		tests
vscode		vscode
.editorconfig		.editorconfig
.eslintrc.json		.eslintrc.json
.gitattributes		.gitattributes
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
.npmignore		.npmignore
.npmrc		.npmrc
.nvmrc		.nvmrc
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DESIGN.md		DESIGN.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY-DATA-HANDLING.md		SECURITY-DATA-HANDLING.md
SECURITY.md		SECURITY.md
commitlint.config.cjs		commitlint.config.cjs
go.mod		go.mod
go.sum		go.sum
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Terrain

Install

Get started

What it catches

What it looks like

Workflow

CI integration

What it is not

AI-aware testing

MCP integration

Plugins

Documentation

License

About

Uh oh!

Releases 8

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Terrain

Install

Get started

What it catches

What it looks like

Workflow

CI integration

What it is not

AI-aware testing

MCP integration

Plugins

Documentation

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages