punch

Getting Started with Punch

This guide walks you through installing Punch, spawning your first fighter, running an autonomous gorilla, and setting up multi-agent coordination — all in under 10 minutes.

Prerequisites

Rust 2024 edition (1.85+) if building from source
An LLM provider — either a local Ollama instance or an API key for Anthropic/OpenAI/etc.

Step 1: Install

Choose your method:

# Via Cargo (recommended)
cargo install punch-cli

# Via Homebrew
brew tap humancto/tap && brew install punch

# From source
git clone https://github.com/humancto/punch && cd punch && cargo build --release

Verify:

punch --version

Step 2: Initialize

punch init

This creates ~/.punch/ with a default configuration file. Edit ~/.punch/config.toml to set your LLM provider:

Option A: Local Ollama (free, private)

api_listen = "127.0.0.1:6660"

[default_model]
provider = "ollama"
model = "qwen3:8b"
base_url = "http://localhost:11434"
max_tokens = 4096
temperature = 0.7

Make sure Ollama is running: ollama serve

Option B: Anthropic Claude

api_listen = "127.0.0.1:6660"

[default_model]
provider = "anthropic"
model = "claude-sonnet-4-20250514"
api_key_env = "ANTHROPIC_API_KEY"

Then export your key: export ANTHROPIC_API_KEY=sk-ant-...

Option C: OpenAI

api_listen = "127.0.0.1:6660"

[default_model]
provider = "openai"
model = "gpt-4o-mini"
api_key_env = "OPENAI_API_KEY"

Option D: Google Gemini (fast + free tier)

api_listen = "127.0.0.1:6660"

[default_model]
provider = "google"
model = "gemini-2.0-flash"
api_key_env = "GOOGLE_API_KEY"

Get a key at aistudio.google.com/apikey.

Option E: Groq (extremely fast, free tier)

api_listen = "127.0.0.1:6660"

[default_model]
provider = "groq"
model = "llama-3.3-70b-versatile"
api_key_env = "GROQ_API_KEY"

Tip: Store API keys in ~/.punch/.env — the daemon loads them automatically on startup. Example: echo 'GOOGLE_API_KEY=your-key' >> ~/.punch/.env

Model tip: For reliable tool use (calendar, email, file access), use gpt-4.1-mini or better. Smaller models like gpt-4.1-nano or gemini-2.0-flash-lite may ignore tools entirely.

Optional: Smart Model Routing

Instead of sending everything to one model, route by complexity. Simple questions go cheap, complex reasoning goes premium. Add this below your [default_model]:

Single provider (e.g. all Gemini):

[model_routing]
enabled = true

[model_routing.cheap]
provider = "google"
model = "gemini-2.0-flash-lite"
api_key_env = "GOOGLE_API_KEY"

[model_routing.mid]
provider = "google"
model = "gemini-2.5-flash"
api_key_env = "GOOGLE_API_KEY"

[model_routing.expensive]
provider = "google"
model = "gemini-2.5-pro"
api_key_env = "GOOGLE_API_KEY"

Mix providers (use each provider’s strengths):

[model_routing]
enabled = true

[model_routing.cheap]
provider = "groq"
model = "llama-3.3-70b-versatile"
api_key_env = "GROQ_API_KEY"

[model_routing.mid]
provider = "openai"
model = "gpt-4.1-mini"
api_key_env = "OPENAI_API_KEY"

[model_routing.expensive]
provider = "anthropic"
model = "claude-sonnet-4-20250514"
api_key_env = "ANTHROPIC_API_KEY"

Each tier can use a different provider — just add the relevant API keys to ~/.punch/.env. When routing is disabled (or a tier isn’t configured), the [default_model] is used as fallback.

Step 3: Start the Daemon

punch start

Punch auto-spawns a default fighter (“Punch”) with full tool access. You’re ready to chat immediately.

macOS Permissions (Required for Desktop Automation)

Punch can take screenshots, control apps, read messages, and automate your desktop. On macOS, these features require explicit permission grants. Open System Settings > Privacy & Security and enable your terminal app (Terminal, iTerm2, Warp, etc.) for:

Permission	What it enables
Accessibility	UI automation, reading app elements, AppleScript via System Events
Screen Recording	Taking screenshots of your screen and app windows
Full Disk Access	Reading app databases (iMessage history, Safari data, etc.)

Restart your terminal after granting permissions. The Automation permission is auto-prompted the first time Punch uses AppleScript to control a specific app.

Without these permissions, desktop automation tools will fail and Punch will fall back to shell-based alternatives where possible.

Chat via CLI

punch chat "What are the key differences between Rust and Go for systems programming?"

Or via the API

# List fighters
curl http://localhost:6660/api/fighters

# Send a message
curl -X POST http://localhost:6660/api/fighters/{fighter_id}/message \
  -H "Content-Type: application/json" \
  -d '{"message": "Explain the actor model in distributed systems"}'

Connect a messaging channel (optional)

Deploy your fighter to Telegram, Slack, or Discord — the wizard handles bot creation, security, tunnel setup, and webhook registration:

punch channel setup telegram

See Channels Guide for the full documentation.

Step 4: Spawn Specialist Fighters (Optional)

The default fighter handles most tasks. For specialized work, spawn additional fighters:

punch fighter spawn coder       # Full-stack engineer with shell access
punch fighter spawn scout       # Deep research agent
punch fighter spawn oracle      # Conversational AI with broad knowledge

All fighters have access to MCP tools (calendar, email, etc.) configured in your config.toml.

Step 5: Create a Custom Fighter

Fighters are defined by their manifest — a JSON object that controls personality, model, and capabilities:

curl -X POST http://localhost:6660/api/fighters \
  -H "Content-Type: application/json" \
  -d '{
    "manifest": {
      "name": "Atlas",
      "description": "Senior architect who thinks in systems",
      "system_prompt": "You are Atlas, a senior systems architect. You think about distributed systems, scalability, and trade-offs. You always consider failure modes. You draw from real-world experience at companies that operate at scale.",
      "model": {
        "provider": "ollama",
        "model": "qwen3:8b",
        "base_url": "http://localhost:11434",
        "max_tokens": 4096,
        "temperature": 0.7
      },
      "weight_class": "heavyweight",
      "capabilities": [{"type": "memory"}]
    }
  }'

The fighter now has a persistent identity. Every conversation strengthens its Creed — the living document that defines who it is.

Step 6: Unleash a Gorilla

Gorillas are autonomous background agents that run on schedules without human interaction.

# List available gorillas
punch gorilla list

# Unleash the Alpha researcher (runs every 6 hours)
punch gorilla unleash alpha

# Check its status
punch gorilla status alpha

# Stop it
punch gorilla cage alpha

Bundled gorillas

Gorilla	Schedule	What it does
Alpha	Every 6h	Deep research with cross-referencing and fact-checking
Ghost	Every 30m	OSINT monitoring, change detection, anomaly analysis
Prophet	Daily	Probabilistic forecasting with Brier score calibration
Scout Troop	Every 4h	Lead generation with ICP-based scoring
Swarm	Every 3h	Multi-platform social media content creation
Brawler	Every 2h	Web automation, form filling, data extraction
Howler	Every 2 days	Short-form video script creation

Step 7: Fighter-to-Fighter Communication

Fighters can talk to each other — and they remember who they’ve spoken to:

# Spawn two fighters
curl -X POST http://localhost:6660/api/fighters \
  -H "Content-Type: application/json" \
  -d '{"manifest": {"name": "Optimist", "description": "Sees opportunity everywhere", "system_prompt": "You are optimistic about technology and its potential to solve problems."}}'

curl -X POST http://localhost:6660/api/fighters \
  -H "Content-Type: application/json" \
  -d '{"manifest": {"name": "Skeptic", "description": "Questions everything", "system_prompt": "You are deeply skeptical about technology hype. You demand evidence and question assumptions."}}'

# Have them debate
curl -X POST http://localhost:6660/api/fighters/{optimist_id}/message-to/{skeptic_id} \
  -H "Content-Type: application/json" \
  -d '{"content": "AI agents will replace 50% of knowledge work within 3 years. Change my mind."}'

Step 8: Form a Troop

Troops coordinate multiple fighters with different strategies:

# Create a troop with the Pipeline strategy
curl -X POST http://localhost:6660/api/troops \
  -H "Content-Type: application/json" \
  -d '{
    "name": "Review Pipeline",
    "leader": "{architect_id}",
    "members": ["{coder_id}", "{reviewer_id}", "{tester_id}"],
    "strategy": "pipeline"
  }'

# Assign a task — it flows through each member sequentially
curl -X POST http://localhost:6660/api/troops/{troop_id}/tasks \
  -H "Content-Type: application/json" \
  -d '{"task": "Write a rate limiter in Rust, review it for security, then write tests"}'

Available strategies

Strategy	How it works
Pipeline	Output of agent N becomes input to agent N+1
Broadcast	All agents receive the same task, results aggregated
Consensus	All agents vote, majority wins
LeaderWorker	Leader decomposes task, workers execute
RoundRobin	Tasks distributed evenly in rotation
Specialist	Routed to best-matching agent by capability

Step 9: Install Skills from the Marketplace

Skills (called “Moves” in Punch) add domain expertise to your fighters:

# Search for skills
punch move search "security"

# Install one
punch move install security-auditor

# List what's installed
punch move list

# Security scan a skill before installing
punch move scan security-auditor

Punch ships with 103 bundled skills covering programming languages, frameworks, cloud platforms, business operations, and more.

Step 10: Add More Channels

Already set up Telegram in Step 3? Add more platforms — they share the same tunnel:

punch channel setup slack
punch channel setup discord

Manage your channels:

punch channel list                  # See all channels
punch channel tunnel                # Show tunnel URL
punch channel remove slack          # Remove a channel

See Channels Guide for the full documentation.

What to Explore Next

Creeds — Build persistent agent identities: GET /api/creeds/{name}/render
Workflows — Define multi-step automation: POST /api/workflows
Triggers — Fire actions on events: POST /api/triggers
Budgets — Set spending limits per fighter: PUT /api/budget/fighters/{id}
A2A Protocol — Delegate to remote agents: POST /a2a/tasks/send
WASM Plugins — Extend with WebAssembly: capability PluginInvoke
P2P Federation — Connect Punch instances: punch-wire protocol
Dashboard — Monitor everything live: http://localhost:6660/dashboard

Configuration Reference

See punch.toml.example for the full configuration with all options documented.

Architecture

See architecture.md for the internal architecture deep-dive.

Security

See security.md for the 18-layer security model.

Troubleshooting

Port 6660 already in use Kill the existing process and restart:

kill $(lsof -t -i :6660)
punch start

Bot not responding on Telegram

Check the daemon is running: punch status
Check cloudflared tunnel is running: punch channel tunnel
Verify the webhook URL matches your tunnel URL: punch channel status telegram

Bot says “I can’t” instead of using tools Switch to a larger model. Nano/lite models (gpt-4.1-nano, gemini-2.0-flash-lite) don’t reliably call tools. Use gpt-4.1-mini, claude-haiku, or gpt-4.1 instead.

Telegram allowlist 403 Use your numeric user ID, not your @username. Send /start to @userinfobot on Telegram to get your numeric ID.

cloudflared not installed

# macOS
brew install cloudflare/cloudflare/cloudflared

# Linux
curl -L https://pkg.cloudflare.com/cloudflared-stable-linux-amd64.deb -o cloudflared.deb && sudo dpkg -i cloudflared.deb

Quick tunnel URL changed Re-run punch channel setup telegram to re-register the webhook. For a permanent URL, use a named tunnel (punch channel tunnel --mode named).

This site is open source. Improve this page.