GitHub - AgentWorkforce/burn: The first step to burning less tokens is understanding how many you burn to begin with

Understand how you're spending tokens in agent CLIs. Burn ingests Claude Code, Codex, and OpenCode session logs into a local ledger, then shows cost by model, provider, tool, file, workflow, agent, session, and overhead file.

Quick Start

npm i -g relayburn
burn summary

Burn stores data under ~/.agentworkforce/burn/ by default. Set RELAYBURN_HOME to use a different location.

Commands

Command	Use it to
`burn summary`	See total usage and cost by model or provider.
`burn hotspots`	Find expensive files, commands, and subagents.
`burn overhead`	Attribute cached prompt cost to `CLAUDE.md`, `.claude/CLAUDE.md`, and `AGENTS.md`.
`burn compare`	Compare observed model performance by activity: cost per turn, one-shot rate, and sample size.
`burn ingest`	Import existing or live session logs without wrapping the harness.
`burn mcp-server`	Expose read-only cost queries to an agent through stdio MCP.
`burn state`	Inspect, rebuild, and prune derived ledger artifacts.

`burn summary`

Use burn summary when you want the fast answer: how many turns ran, how many tokens they used, and what they cost.

Option	What it does
`--since <range>`	Limit to a relative range like `24h`, `7d`, or `4w`, or an ISO timestamp.
`--project <path>`	Limit to a project path or git-canonical project key.
`--session <id>`	Limit to one session.
`--tag k=v`	Filter by folded enrichment tag. Repeatable; all tags must match.
`--group-by-tag <key>`	Group totals by a folded enrichment tag value.
`--by-provider`	Group totals by provider instead of model.
`--json`	Emit machine-readable output.

Example	Result
`burn summary`	All-time cost by model.
`burn summary --since 24h`	Cost from the last 24 hours.
`burn summary --by-provider`	Cost grouped by effective provider.
`burn summary --tag persona=code-reviewer`	Cost for sessions stamped with that persona tag.
`burn summary --group-by-tag persona`	Cost grouped by persona value.

Synthetic-routed models are recognized from hf:*, accounts/fireworks/models/*, and synthetic/*.

`burn hotspots`

Use burn hotspots when you want to know what made a session or time window expensive. The default view attributes spend to files, bash commands, and subagents.

Option	What it does
`--since <range>`	Limit to a relative range or ISO timestamp.
`--project <path>`	Limit to a project.
`--session <id>`	Limit to a single session id.
`--workflow <id>`	Limit to turns folded under a `workflowId` enrichment stamp.
`--provider <csv>`	Restrict to providers (case-insensitive CSV — e.g. `anthropic,openai`).
`--all`	Show every row instead of the top 10.
`--group-by <dim>`	Focus one rollup: `attribution`, `bash`, `bash-verb`, `file`, or `subagent`.
`--patterns [csv]`	Run waste-pattern detectors instead of the attribution view. Pass without a value to enable every detector, or pass a CSV (e.g. `retry-loop,failure-run`).
`--findings`	Emit the unified findings table instead of the per-detector grouping. Implies `--patterns` if not already set.
`--json`	Emit machine-readable output.

Example	Result
`burn hotspots --since 7d`	Top costly files, bash commands, and subagents for the week.
`burn hotspots --all --project .`	Full project hotspot list.
`burn hotspots --group-by bash-verb --since 7d`	Bash verbs ranked by cost.
`burn hotspots --session <uuid>`	Restrict the standard attribution view to one session.
`burn hotspots --patterns retry-loop,failure-run`	Surface retry/failure waste-pattern findings only.
`burn hotspots --findings --since 7d`	Unified severity-ranked findings list across every detector.
`burn hotspots --provider anthropic`	Restrict attribution to Anthropic-served turns.

The per-session aggregate view (--session with no id) and --explain-drift are not yet ported — passing them exits 2 with a directed message.

`burn overhead`

Use burn overhead when you want to know how much standing instruction files cost. Burn discovers CLAUDE.md, .claude/CLAUDE.md, and AGENTS.md, then attributes cached prompt cost to files and headed sections.

Option	What it does
`trim`	Print projected-savings diffs for high-cost headed sections. Burn does not modify files.
`--project <path>`	Project to inspect. Defaults to the current directory.
`--since <range>`	Limit attribution to a time window.
`--kind <k>`	Limit to `claude-md` or `agents-md`.
`--top <n>`	In `trim` mode, recommendations per file.
`--json`	Emit machine-readable attribution for report mode or structured trim recommendations in `trim` mode.

Example	Result
`burn overhead`	Cost per overhead file and section.
`burn overhead --since 30d`	Overhead cost from the last 30 days.
`burn overhead --kind claude-md`	Claude instruction files only.
`burn overhead trim --top 3`	Top three trim recommendations per file.
`burn overhead trim --json`	Structured trim recommendations with projected savings and unified diffs.

Harnesses pay for different files: Claude Code pays for CLAUDE.md; Codex and OpenCode pay for AGENTS.md.

`burn compare`

Use burn compare when you want evidence for model choice. It compares models on the work you actually ran, grouped by activity such as coding, debugging, testing, review, exploration, docs, and refactoring.

Option	What it does
`<model_a,model_b[,...]>`	Required comma-separated model list. At least two models.
`--since <range>`	Limit to a relative range or ISO timestamp.
`--project <path>`	Limit to a project.
`--session <id>`	Limit to one session.
`--workflow <id>`	Limit to turns folded with stamp `workflowId=<id>`.
`--agent <id>`	Limit to turns folded with stamp `agentId=<id>`.
`--provider <csv>`	Comma-separated effective providers (case-insensitive).
`--min-sample <n>`	Flag cells below the sample threshold. Default: `5`.
`--fidelity <class>`	Minimum data quality: `full`, `usage-only`, `aggregate-only`, `cost-only`, or `partial`.
`--include-partial`	Include every turn. Shorthand for `--fidelity partial`.
`--json`	Emit a stable JSON object.
`--csv`	Emit one row per model/activity pair.

burn compare reads the ledger as-is — it does NOT run an ingest sweep first. Chain burn ingest && burn compare … (or run burn ingest --watch in the background) when you need the freshest data.

Example	Result
`burn compare claude-sonnet-4-6,claude-haiku-4-5 --since 30d`	Side-by-side activity table.
`burn compare claude-opus-4-7,claude-sonnet-4-6 --project . --json`	Project-scoped JSON comparison.
`burn compare claude-sonnet-4-6,claude-haiku-4-5 --fidelity full`	Compare only full-fidelity turns.
`burn compare claude-sonnet-4-6,claude-haiku-4-5 --include-partial`	Include lower-fidelity records too.

Run burn summary --by-provider to discover model IDs present in your ledger.

`burn ingest`

Use burn ingest when sessions already exist, or when another process owns the harness spawn. Default mode scans Claude Code, Codex, and OpenCode stores once.

Option	What it does
`--watch`	Keep polling session stores in the foreground.
`--interval <ms>`	Poll interval in milliseconds. Default: `1000`.
`--quiet`	Suppress normal ingest messages.
`--hook claude`	Read one Claude Code hook payload from stdin and ingest its transcript.

Example	Result
`burn ingest`	Scan all known session stores once.
`burn ingest --watch`	Keep the ingest loop running.
`burn ingest --hook claude --quiet`	Claude Code hook path for orchestrators.

`burn mcp-server`

Use burn mcp-server when an agent should query its own spend mid-session via MCP. The server is stdio-only and read-only.

Option	What it does
`--session-id <uuid>`	Default session ID used by MCP tools when the caller omits one.

Tool	What it returns
`burn__sessionCost`	Total USD, tokens, turns, and models for a session.

Example	Result
`burn mcp-server --session-id <uuid>`	Start a session-scoped stdio MCP server.
`burn mcp-server`	Start a server where tools require explicit session IDs.

`burn state`

Use burn state when reports look stale or you want to inspect or rebuild derived storage. Burn stores events and stamps in burn.sqlite, and content/search data in content.sqlite.

Subcommand or option	What it does
`burn state` or `burn state status`	Print status for indexes, content, classifier, and archive.
`--json`	Emit machine-readable status or archive rebuild/vacuum output.
`rebuild index`	Rebuild derivable SQLite read-model data.
`rebuild content`	Re-parse source session files to backfill content and user turns.
`rebuild archive`	Refresh archive metadata in `burn.sqlite`.
`rebuild all [--force]`	Rebuild derivable state.
`prune [--days <n>]`	Delete expired content sidecars. Use `forever` to disable.
`prune --force`	Delete recoverable sidecars even if source session files still exist.
`reset [--force] [--reingest] [--json]`	Wipe derived state. Dry-run without `--force`; preserves config, pricing overrides, and source harness logs.

Example	Result
`burn state`	Derived artifact status.
`burn state status --json`	Machine-readable status.
`burn state rebuild classify --force`	Reclassify every turn with current rules.
`burn state prune --days 30`	Prune content older than 30 days, keeping recoverable sidecars.

Local Data

Path or setting	Purpose
`~/.agentworkforce/burn/burn.sqlite`	Events, stamps, sessions, relationships, and archive metadata.
`~/.agentworkforce/burn/content.sqlite`	Content blobs and the FTS5 search index.
`~/.agentworkforce/burn/config.json`	Content-storage and retention configuration.
`~/.agentworkforce/burn/pending-stamps/`	Temporary manifests used by launchers that do not expose a session ID before spawn.
`RELAYBURN_HOME`	Override the whole Burn data directory.
`RELAYBURN_SQLITE_PATH`	Override the events database path.
`RELAYBURN_CONTENT_PATH`	Override the content database path.
`RELAYBURN_CONTENT_STORE=full	hash-only
`RELAYBURN_CONTENT_TTL_DAYS=<n>`	Sidecar retention. Default: `90`.

Reports read local data from the ledger and derived sidecars.

Packages

Package	Purpose
`relayburn`	npm install wrapper that resolves the prebuilt Rust `burn` binary from `@relayburn/cli-<platform>` optional dependencies.
`@relayburn/sdk`	Node facade over the Rust SDK, resolved through `@relayburn/sdk-<platform>` optional dependencies.
`@relayburn/cli-<platform>`	Prebuilt `burn` binary packages for supported OS/CPU targets.
`@relayburn/sdk-<platform>`	Prebuilt napi-rs packages for supported OS/CPU targets.
`relayburn-sdk`	Rust crate with the embedding API and internal reader/ledger/analyze/ingest modules.
`relayburn-cli`	Rust crate that produces the `burn` binary.

Development

pnpm install
cargo build --workspace
cargo test --workspace
pnpm run test
pnpm run build:napi

The npm workspace contains the Node SDK facade, the relayburn install wrapper, and the platform package manifests used by release automation.

Pricing

Burn ships with a vendored models.dev pricing snapshot. Refresh it with:

pnpm run pricing:update

User overrides live at $RELAYBURN_HOME/models.dev.json and take precedence at lookup time.

What this is for

Agent spend happens in a blind spot. You can see a daily dollar total, and maybe a breakdown by model. You cannot see which tool call, file, subagent, or workflow step drove the cost. Burn makes that question answerable.

The deeper question burn is built around is:

Would the same work cost less with a different model, harness, or tool choice - in dollars or token usage?

You cannot answer that from aggregate spend. It requires attribution at the level of the actual work: this Read cost $0.47 because it added 8,200 tokens to context that rode in every one of the next 23 turns' cache-reads. Once spend is visible at that grain, the choice between Opus and Haiku, between Claude Code and another harness, or between letting an agent re-read a file and passing it a cached summary, becomes a decision you can reason about instead of a guess.

Three concrete questions follow:

How much did I spend? - per agent, workflow, session, model, and tool call.
Why was it spent? - tool calls, files, subagents, cache-hit vs fresh input, and persistence.
Where can I save? - redundant reads, bloated context prefixes, retry loops, and model choices that cost more than they needed to.

Burn is local-first. Data lives in SQLite databases on your machine. Burn never phones home. Pricing is looked up at query time from a vendored snapshot, so rate corrections never require rewriting the ledger.

What burn is not

Not a dashboard or a product with a UI of its own.
Not an automatic optimizer. It surfaces the choices; you decide.
Not a leaderboard or a social service.

Composability: how burn plugs into a spawner

Burn is designed to be called by whatever code spawns agent sessions. If you control the spawn, you know things the session log never records on its own: the workflow this session is part of, the persona it is running as, the agent ID, and the tier. Burn accepts that context, attaches it to the session, and makes it queryable later alongside the usage data from the session log.

The primitive is stamping: attach metadata to a session by ID, before or after any turns have been recorded. Launchers that do not know the session ID before spawn should call @relayburn/sdk writePendingStamp() before starting the agent, then run burn ingest / ingest() to fold the tags onto the discovered turns. Direct Rust embedders with an exact session ID can use relayburn_sdk::Stamp and relayburn_sdk::StampSelector against a LedgerHandle.

Stamp selectors:

{ sessionId } - all turns in a session.
{ messageId } - exactly one turn.
{ sessionId, range: { fromTs, toTs } } - a time window, such as a single workflow step within a long-lived session.

Enrichment values are plain strings (Record<string, string>). Burn does not care what keys you use. Typical keys: agentId, parentAgentId, workflowId, stepId, persona, tier, harness, userLabel.

This is the composability surface. Burn stays small; the spawner owns the context and decides what to attach.

Spawner-integrated ingest

The recommended launcher integration is the Node SDK pending-stamp primitive:

import { writePendingStamp } from "@relayburn/sdk";

await writePendingStamp({
  harness: "codex",
  cwd: process.cwd(),
  enrichment: {
    persona: "code-reviewer",
    personaTier: "senior",
    agentworkforce: "1",
  },
});

Then spawn the harness normally and let burn ingest or ingest() scan the session stores. Claude launchers can either preallocate --session-id and write an exact session stamp from Rust, or use writePendingStamp({ harness: "claude", ... }) when the final session ID is not available before spawn.

Codex and OpenCode do not expose a pre-spawn session ID. writePendingStamp() writes a pending-stamp manifest under $RELAYBURN_HOME/pending-stamps/ before the launcher spawns the agent. Ingest resolves the manifest against the first matching session file before the first turn is appended. Claude launchers can use the same pending-stamp path when the final session ID is not available before spawn. Abandoned pending manifests are cleaned up after 24 hours.

For passive ingest, run:

burn ingest
burn ingest --watch [--interval <ms>]

burn ingest scans Claude, Codex, and OpenCode stores once and uses the same cursor and dedup path as the reporting commands. burn ingest --watch keeps that scan loop running in the foreground.

Hook-based ingest for orchestrators

If your code already controls the Claude Code spawn, you can install burn's hooks per invocation via Claude's --settings flag without mutating global ~/.claude/settings.json. Wire the hook command to:

burn ingest --hook claude --quiet

Hook payloads land on stdin and get forwarded to burn ingest. The command is safe to re-fire on every hook; the ledger cursor and dedup path keep ingestion idempotent, so the hook path and normal session-store path reconcile against the same session.

Name		Name	Last commit message	Last commit date
Latest commit History 666 Commits
.github/workflows		.github/workflows
crates		crates
packages		packages
scripts		scripts
tests/fixtures		tests/fixtures
.gitignore		.gitignore
.npmrc		.npmrc
.nvmrc		.nvmrc
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
burn-readme-banner.png		burn-readme-banner.png
landscape.md		landscape.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
rust-toolchain.toml		rust-toolchain.toml
tsconfig.base.json		tsconfig.base.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quick Start

Commands

`burn summary`

`burn hotspots`

`burn overhead`

`burn compare`

`burn ingest`

`burn mcp-server`

`burn state`

Local Data

Packages

Development

Pricing

What this is for

What burn is not

Composability: how burn plugs into a spawner

Spawner-integrated ingest

Hook-based ingest for orchestrators

About

Uh oh!

Releases 267

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Quick Start

Commands

burn summary

burn hotspots

burn overhead

burn compare

burn ingest

burn mcp-server

burn state

Local Data

Packages

Development

Pricing

What this is for

What burn is not

Composability: how burn plugs into a spawner

Spawner-integrated ingest

Hook-based ingest for orchestrators

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 267

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`burn summary`

`burn hotspots`

`burn overhead`

`burn compare`

`burn ingest`

`burn mcp-server`

`burn state`

Packages