ralph-town

Disposable Daytona sandboxes for LLM evals, CLI smoke tests, and isolated command execution.

Ralph-Town is a CLI, so any coding harness or LLM tool runner that can execute shell commands can use it inline. MCP clients can also use the companion MCP server directly.

Why?

LLM tools and eval harnesses often need to run commands against real projects without touching your local machine. A disposable Daytona sandbox gives each run a clean environment, controlled credentials, and structured output that another tool or model can consume.

Instead of asking an agent to run a risky install, generated command, or smoke test on your laptop, ask it to prefix the command with ralph-town run -- and inspect the result.

Quick start

# Run any command in a fresh sandbox, then delete it
ralph-town run -- node --version

# Smoke-test a CLI without installing it locally
ralph-town run -- pnpx cowsay@latest "hello from a sandbox"

# Run against a clean repository checkout
ralph-town run \
  --repo https://github.com/user/project \
  -- pnpm test

# Preserve the sandbox for debugging
ralph-town run --keep -- sh -lc 'node --version && npm --version'

# Use structured output for eval harnesses
ralph-town run --json -- sh -lc 'printf "ok\\n"; exit 0'

run creates a sandbox, executes the command through Daytona's process API, captures stdout/stderr/exit code, and deletes the sandbox unless --keep is set.

Typical LLM session usage

If your coding assistant has a shell tool, it can run Ralph-Town inline without any special integration:

# Check whether a generated command works before trying it locally
ralph-town run -- pnpx some-cli@latest --help

# Try a repo test suite in a disposable clone
ralph-town run \
  --repo https://github.com/user/project \
  -- sh -lc 'pnpm install --frozen-lockfile && pnpm test'

# Capture JSON for automated grading or eval analysis
ralph-town run --json -- python - <<'PY'
print('hello from an isolated Daytona sandbox')
PY

For richer tool integration, expose mcp-ralph-town to an MCP-capable client and call sandbox_run, sandbox_create, sandbox_exec, and the other sandbox tools directly.

Example: my-pi

my-pi is my personal coding agent harness. It is not required for Ralph-Town, but it is a useful example of the kind of CLI harness you can smoke test inside a disposable sandbox:

ralph-town run -- pnpx my-pi@latest --help

Install

npm install -g ralph-town
# or
npx ralph-town --help

Commands

# One-shot command execution
ralph-town run -- <command>

# Create a reusable sandbox
ralph-town sandbox create [--name NAME]

# Get SSH credentials
ralph-town sandbox ssh <id>

# List active sandboxes
ralph-town sandbox list

# Execute command in an existing sandbox
ralph-town sandbox exec <id> <command>

# Check sandbox health
ralph-town sandbox health <id> [--ping]

# Delete sandbox
ralph-town sandbox delete <id>

JSON result shape

ralph-town run --json -- sh -lc 'printf "ok\\n"'

{
	"sandbox_id": "abc123",
	"command": "'sh' '-lc' 'printf \"ok\\\\n\"'",
	"repo": null,
	"branch": null,
	"cwd": null,
	"exit_code": 0,
	"stdout": "...",
	"stderr": "",
	"timed_out": false,
	"duration_ms": 2312,
	"kept": false,
	"deleted": true,
	"cleanup_error": null
}

Environment variables

Variable	Context	Description
`DAYTONA_API_KEY`	local orchestrator	Daytona API key from daytona.io
`GH_TOKEN`	local orchestrator	Optional GitHub token for commands run locally
`ANTHROPIC_API_KEY`	local orchestrator	Optional Anthropic key for commands run locally
`SANDBOX_GH_TOKEN`	sandbox	Optional GitHub token forwarded into sandboxes as `GH_TOKEN`
`SANDBOX_ANTHROPIC_API_KEY`	sandbox	Optional Anthropic key forwarded into sandboxes as `ANTHROPIC_API_KEY`

GITHUB_PAT is still accepted as a deprecated compatibility alias for SANDBOX_GH_TOKEN.

Packages

Package	Description
`packages/cli`	Main CLI tool
`packages/mcp-ralph-town`	MCP server for sandbox orchestration

Development

pnpm dev
pnpm run check
pnpm run test
pnpm run build

Research

See docs/RESEARCH.md for Daytona SDK notes and implementation findings.

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
.changeset		.changeset
.claude-plugin		.claude-plugin
.github/workflows		.github/workflows
.vscode		.vscode
docs		docs
packages		packages
plugins/ralph-town		plugins/ralph-town
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
renovate.json		renovate.json
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ralph-town

Why?

Quick start

Typical LLM session usage

Example: my-pi

Install

Commands

JSON result shape

Environment variables

Packages

Development

Research

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ralph-town

Why?

Quick start

Typical LLM session usage

Example: my-pi

Install

Commands

JSON result shape

Environment variables

Packages

Development

Research

About

Topics

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages