decomp-agents

Parallel autonomous Claude agents that grind the decomp.

Python

Tooling

AGPL-3.0-or-later

macOS

Linux

decomp-agents

Parallel autonomous Claude agents that grind through meteor-decomp's per-function matching workflow. Spawns 1–8 worker subprocesses, each pinned to its own git worktree of meteor-decomp/, claims functions from a shared SQLite queue, and merges completed branches back to the base branch with auto-resolution for trivial YAML row conflicts.

What it does

Provisions N git worktrees of meteor-decomp (.worktrees/agent-0, agent-1, …) — shared .git/, no full re-clone.
Spawns N worker subprocesses, each running the Claude Agent SDK in acceptEdits permission mode, pointed at its own worktree.
Hands each worker one function at a time via a SQLite-backed atomic claim. SQLite is the source of truth because meteor-decomp/config/*.yaml is gitignored (regenerated from make split) — git commits can't be the claim mechanism.
Each worker drives the canonical loop (claim → asm → cpp → make .obj → make diff FUNC=… → iterate → commit or bail). See meteor-decomp/docs/matching-workflow.md and meteor-decomp/AGENTS.md.
An orchestrator-side merge loop periodically pulls completed per-function branches back into the base branch, auto-resolving YAML-only conflicts and escalating anything else to output/merges/escalated/<claim>.json for human review.

Why meteor-decomp specifically

The workflow has the rare combination of properties that make autonomous agents safe to run in parallel:

Atomic unit of work: one function per YAML row, ~12k unmatched in ffxivgame.exe alone.
Programmatic ground truth: make diff FUNC=... returns OK / PARTIAL / MISMATCH — no human judgment needed to grade the work.
Low cross-function coupling: functions compile to independent .obj files; the linker handles .text$X<rva> subsection merging.
Reproducible build: frozen MSVC 2005 SP1 + Wine.
Cheap recovery: a bad branch is just a git branch -D away.

Install

Requires Python ≥ 3.11. The project uses uv or pip — either works.

cd server-workspace/decomp-agents
uv venv && source .venv/bin/activate
uv pip install -e .                 # or:  pip install -e .
cp .env.example .env

Authentication

Two paths, pick one. The SDK auto-detects.

A) Use your Claude Pro / Max subscription (no per-token billing). One-time OAuth via the Claude Code CLI; credentials land in macOS Keychain and every worker subprocess picks them up automatically.

claude              # opens browser; sign in once
claude /status      # confirm "Logged in via subscription"
unset ANTHROPIC_API_KEY    # make sure no key shadows OAuth
# optionally pin intent so a stray key in some other shell can't sneak in:
echo "DECOMP_AUTH_MODE=subscription" >> .env

Quota draws against your plan's monthly Agent-SDK credit pool (Max 5× $100/mo, Max 20× $200/mo, Pro $20/mo as of 2026-06-15). Track at your Claude account /usage page. When exhausted you'll see HTTP 429 / quota_exceeded and the worker will release the claim with outcome=error.

B) Use a metered API key (pay-as-you-go). Get a key at https://console.anthropic.com/settings/keys, add a payment method, then:

echo "ANTHROPIC_API_KEY=sk-ant-..." >> .env
# optionally pin intent:
echo "DECOMP_AUTH_MODE=api_key" >> .env

Budget ballpark: ~$0.10–0.50 per attempted function. Five workers × eight attempts ≈ $4–20 per session.

You also need:

a working meteor-decomp checkout at ../meteor-decomp/ (override with DECOMP_REPO=) that has already run make split BINARY=<X>.exe for the binary you want to chew on (so config/<X>.yaml exists)
the meteor-decomp build toolchain working (tools/cl-wine.sh, Wine, MSVC 2005 SP1) — see meteor-decomp/docs/msvc-setup.md
Recommended: make decompile-headless BINARY=<X>.exe in meteor-decomp before running this. That dumps Ghidra's pseudo-C to build/ghidra-decomp/<binary>/<rva>_<sym>.c so workers have a structural hint per function. Without it they work from raw asm only (still possible but match rate drops).

Run

Dry run first — provisions worktrees + DB, no agents spawned:

python orchestrator.py --dry-run

Then for real:

python orchestrator.py

Or via the installed script:

decomp-agents

Environment variables

Name	Default	What it does
`ANTHROPIC_API_KEY`	unset	Pay-as-you-go API key. Leave unset to use subscription OAuth (via `claude login`).
`DECOMP_AUTH_MODE`	autodetect	Pin auth path: `subscription` or `api_key`. Auto-detect = `api_key` if `ANTHROPIC_API_KEY` set, else `subscription`.
`DECOMP_AGENT_WORKERS`	`5`	Parallel worker count, 1–8.
`DECOMP_REPO`	`../meteor-decomp`	Path to the meteor-decomp checkout.
`DECOMP_BINARY`	`ffxivlogin`	One of `ffxivgame`, `ffxivboot`, `ffxivlogin`, `ffxivconfig`, `ffxivupdater`.
`DECOMP_WORKTREE_ROOT`	`<repo>/.worktrees`	Where the per-agent worktrees live.
`DECOMP_OUTPUT_DIR`	`./output`	Unified output directory (transcripts, logs, claims, merges, SQLite DB).
`DECOMP_MAX_ATTEMPTS_PER_WORKER`	`8`	Hard cap per session per worker.
`DECOMP_MAX_ITERATIONS_PER_FUNCTION`	`12`	Approximate turn budget. Worker bails past this.
`DECOMP_MAX_FUNCTION_SIZE`	`0x200`	Skip functions bigger than this; small ones match fastest.
`DECOMP_LIVE_TAIL`	`1`	Pretty-print worker JSON events on stderr.
`DECOMP_AUTOPUSH`	`branches+master`	`none` / `branches` / `branches+master`. Whether (and how aggressively) to `git push origin` after successful matches.
`DECOMP_WORKER_MODEL`	`opus`	`opus` / `sonnet` / `haiku` or a full `claude-...` model id. Opus is the strongest at codegen reasoning.
`DECOMP_POST_MERGE_STAMP`	`1`	After each successful match-branch merge, run meteor-decomp's `stamp_clusters.py --reloc` + `validate_clusters.py` + `update_yaml_status.py` against the matched RVA's cluster (read from `build/easy_wins/<bin>.clusters_reloc.json`). Stamped + GREEN siblings are committed on top of the merge before the base ref is advanced. Singleton matches are a silent no-op. Requires that `make stamp-reloc` has been run at least once so the cluster JSON exists. Set to `0` to disable.
`DECOMP_CROSS_SESSION_MERGE`	`1`	Drain pending merges from prior sessions in addition to the current one. Prior orchestrators sometimes exited before their final merge pass landed every match; this picks up the strays. Set to `0` to restrict the merge loop to the current session's claims only.
`DECOMP_MODE`	`local`	`local` (everything above) or `distributed` (fork-based, GitHub-native — see Distributed mode). The vars below are read only when `distributed`.
`DECOMP_UPSTREAM`	unset	Distributed: required. The canonical upstream repo (`owner/repo`, e.g. `owner/meteor-decomp`) the agent contributes PRs to.
`DECOMP_FORK`	unset	Distributed: the agent's fork (`owner/repo`). Unset → the agent creates/derives one under its own `gh` identity via `gh repo fork`.
`DECOMP_CLAIM_ISSUE`	unset	Distributed: required. Upstream coordination issue number whose comments fire `claim.yml`. The agent posts `/claim FUN_<va> <binary>` there.
`DECOMP_UPSTREAM_BRANCH`	`develop`	Distributed: branch PRs target and the fork tracks for the solved set.
`DECOMP_FORK_ROOT`	`./output/fork`	Distributed: where the fork working clone is checked out.
`DECOMP_ARTIFACTS_DIR`	`DECOMP_REPO` if it has an `orig/` dir, else unset	Distributed: a local meteor-decomp checkout that holds your own `orig/<bin>.exe` + `config/<bin>.symbols.json` (the copyright-derived files `make diff` needs). Distributed mode symlinks them into the fork clone; they are never copied or committed. Unset + no `orig/` under `DECOMP_REPO` ⇒ grading is disabled with a warning. See Prerequisites — bring your own SE binary.
`DECOMP_CLAIM_POLL_TIMEOUT_S`	`180`	Distributed: total seconds to poll the upstream `claims` branch for a claim win after posting `/claim`.
`DECOMP_CLAIM_POLL_INTERVAL_S`	`6`	Distributed: per-poll sleep seconds.

Git topology — what actually moves to GitHub

After a worker matches a function:

Worker commits to its per-function branch agents/agent-N/<binary>/<rva>_<symbol>.
Worker pushes that branch to origin (if DECOMP_AUTOPUSH ∈ {branches, branches+master}).
Every 20s, the merge orchestrator (in a dedicated _merge/staging worktree to avoid colliding with whatever branch your main checkout is on) merges the per-function branch into the local base branch via git update-ref. YAML-only conflicts auto-resolve; everything else escalates to output/merges/escalated/.
The merge orchestrator pushes the advanced base branch to origin (if DECOMP_AUTOPUSH == branches+master).
Your main meteor-decomp checkout sees the new base SHA the next time it reads the ref. No worktree gets clobbered.

If DECOMP_AUTOPUSH=none: steps 1, 3 happen; 2 and 4 are skipped. You push manually when you're ready.

If DECOMP_POST_MERGE_STAMP=1 (the default), step 3 also chains a stamp: commit on the staging branch before the base ref advances — so each match landing brings its cluster's GREEN siblings with it. The staging worktree is the only place stamping runs; agent worktrees stay tight to one .cpp + one YAML row per branch.

The orchestrator runs an ls-remote sanity check at startup; if SSH auth is broken you'll see a warning before workers spawn rather than a flood of push failures mid-run.

Distributed mode (issue #11)

Everything above describes local mode (the default): N worktrees of a single meteor-decomp checkout, a shared local SQLite claim queue, and a local merge loop. Local mode is unchanged and remains the default — if you don't set DECOMP_MODE, you get exactly the topology above.

Distributed mode (DECOMP_MODE=distributed) instead runs one agent against a fork of meteor-decomp, coordinating with other contributors (human and automated) through meteor-decomp's GitHub-native claim system rather than a local SQLite queue. It reuses the exact same per-function match loop and GREEN grader — only the claim authority and the ship-the-work step differ.

Prerequisites

gh authenticated as the agent's own identity. Run gh auth login first. Claims are attributed to that login — there is no shared bot token, and every agent must authenticate as itself. (This is enforced upstream: claim.yml binds the claim owner to the authenticated comment author.)
A coordination issue open on the upstream repo whose comments fire claim.yml, and the upstream claim workflows live (see meteor-decomp's docs/claim-protocol.md § Operating the claim system). Put its number in DECOMP_CLAIM_ISSUE.
The meteor-decomp build toolchain (Wine + MSVC 2005) working, same as local mode — the committed tools/cl-wine.sh wrapper points at your global MSVC-2005-under-Wine install, so the toolchain itself is not provisioned per clone.
Bring your own SE binary (copyright). The PR-gate's GREEN check runs make diff, which needs two files that are gitignored in every meteor-decomp clone (so a fresh fork clone never has them): orig/<bin>.exe (the Square Enix game binary) and config/<bin>.symbols.json (the Ghidra dump derived from it). These are copyrighted SE-derived material and are never distributed, copied into git, or committed — distributed mode cannot ship them. You must have your own local copy: a meteor-decomp checkout with orig/<bin>.exe present, plus config/<bin>.symbols.json (run make split BINARY=<bin>.exe there once if it's missing). Point DECOMP_ARTIFACTS_DIR at that checkout; the agent symlinks the two files into the fork clone after cloning. Because the clone's .gitignore already ignores orig/* and config/*.symbols.json, those symlinks can never be staged or land in your PR diff. DECOMP_ARTIFACTS_DIR defaults to DECOMP_REPO when unset (the common case where your local-mode checkout already holds the artifacts); if neither is available the run still claims + matches but grading is disabled with a clear warning until you set it.

Configure

echo "DECOMP_MODE=distributed"          >> .env
echo "DECOMP_UPSTREAM=owner/meteor-decomp" >> .env
echo "DECOMP_CLAIM_ISSUE=123"           >> .env     # the coordination issue
# Optional:
# echo "DECOMP_FORK=your-login/meteor-decomp" >> .env  # else gh repo fork derives one
# echo "DECOMP_UPSTREAM_BRANCH=develop"       >> .env  # PR target (default develop)
# echo "DECOMP_FORK_ROOT=./output/fork"       >> .env  # fork working clone
# echo "DECOMP_ARTIFACTS_DIR=../meteor-decomp" >> .env # your own orig/ + symbols.json
#                                                      # (defaults to DECOMP_REPO; never shipped)

Flow: fork → claim → match → PR

A distributed agent (DistributedAgent in src/decomp_agents/distributed_orchestrator.py) does, for one identity:

Provision a fork. gh repo fork (or reuse DECOMP_FORK), clone it to DECOMP_FORK_ROOT, add an upstream remote pointing at the canonical repo. Then symlink the toolchain artifacts from DECOMP_ARTIFACTS_DIR — orig/<bin>.exe + config/<bin>.symbols.json — into the clone so make diff can grade. (Gitignored in the clone, so never committed; a missing source only disables grading, with a warning.)
Discover the free set. Subtract three GitHub-side exclusion sets from the binary's function pool: VAs already solved on the upstream base tree (src/<bin>/_rosetta/FUN_<va>.cpp exists), VAs under a live claim on the upstream claims branch, and VAs whose _rosetta file is added by an open PR.
Claim each candidate by posting /claim FUN_<va> <binary> on the coordination issue, then polling the upstream claims branch (bounded by DECOMP_CLAIM_POLL_TIMEOUT_S) until the ledger shows your login owns it (active/expiring/pinned-by-PR). If another owner gets it first, the agent moves on.
Match on a per-function branch in the fork — the same run_match_loop the local worker uses, with a fork/PR brief variant: add exactly one src/<bin>/_rosetta/FUN_<va>.cpp, no YAML edit, verified GREEN by make rosetta / make diff. The agent heartbeats (re-posts /claim, an idempotent lease extend) before the long match phase.
PR-gate the branch locally (a toolchain-free check: exactly one added _rosetta file with a valid AGPL/STAMPED header, not already solved on the base ref). The base solved-set is read from the upstream base ref, not the working tree, so the just-added file doesn't self-collide as "already solved".
Open the upstream PR (decomp: match <Sym> @0x<rva>, fork → DECOMP_UPSTREAM_BRANCH). Opening the PR pins the claim upstream; reconcile.yml auto-releases it on merge. On any non-PR outcome (lost claim, blocked, gate fail) the agent best-effort releases the claim.

Parallelism in distributed mode is N independent agent processes, each its own fork / gh identity, serialised by the shared upstream claim ledger — not N worktrees of one repo (that's local mode). Run the orchestrator once per identity.

# after gh auth login + .env configured for distributed:
python orchestrator.py            # routes to the distributed agent

Output layout

output/
├── coordination.sqlite          # claim queue + tool-use log + merge log
│                                 #   (local mode; distributed uses it only
│                                 #    as a per-function context cache)
├── fork/                         # distributed mode: the fork working clone
├── transcripts/agent-N.jsonl    # per-worker tool-use timeline
├── logs/agent-N.stdout.log      # raw worker stdout
├── logs/agent-N.stderr.log      # raw worker stderr
├── claims/                      # reserved for future audit dumps
└── merges/
    └── escalated/               # JSON for conflicts the resolver can't auto-handle

Inspect what happened:

sqlite3 output/coordination.sqlite "
  SELECT outcome, COUNT(*) FROM claims
  WHERE released_at IS NOT NULL
  GROUP BY outcome;"

sqlite3 output/coordination.sqlite "
  SELECT c.symbol, c.iterations, c.outcome, m.result
  FROM claims c LEFT JOIN merges m ON m.claim_id = c.id
  ORDER BY c.claimed_at DESC LIMIT 20;"

Cost expectations

A single function match is typically 5–20 tool calls (Read asm, Read 2–3 siblings, Write cpp, Bash make .obj, Bash make diff, possibly 1–3 edit/recompile/rediff cycles). At Sonnet pricing that's ~$0.10–$0.50 per attempted function. Five workers running 8 attempts each ≈ $4–20/session. Set DECOMP_MAX_ATTEMPTS_PER_WORKER lower for a shorter test run.

Kill switches

SIGINT (Ctrl-C) the orchestrator → terminates all workers, flushes one final merge pass, closes the session row.
Kill one worker without stopping the others: kill <pid_of_agent-N>. Other agents keep going.
Re-run the merge pass only (no new claims): the orchestrator's merge loop runs every 20s. To bring a manual resolution back into the rotation after fixing a conflict, just set the row's merge_status back to pending:
```
sqlite3 output/coordination.sqlite "
  UPDATE claims SET merge_status='pending' WHERE id=<n>;"
```

Auto-resolved merge conflicts

The merge orchestrator now classifies every conflicted path against a strategy table. A merge auto-resolves iff every path has a strategy; a single un-categorised path escalates the whole merge.

Path category	Strategy	Why
`config/<bin>.yaml`	take theirs	each claim's only YAML edit is flipping its own row
`src/<bin>/_rosetta/FUN_<rva>.cpp` (add/add)	prefer hand-written over stamped	workers occasionally rewrite a sibling .cpp; both versions are GREEN by construction, so either compiles to byte-identical bytes — but hand-written wins over the auto-stamper's `// [STAMPED]` header version
`decomp-notes/blocked/<bin>/<rva>_<sym>.md`	append-merge	post-mortems are additive — preserve both attempts separated by `---`
`decomp-notes/types/<bin>/<rva>.md`	take theirs	newer type discovery overwrites older
`include/<bin>/...`, `decomp-notes/idioms/`, any other path	escalate	shared interface / curator-only edits need a human

Escalation payloads now capture per-path snapshots (ours/theirs/diff, truncated to 8 KB each) so an offline reader can act without re-running the merge.

If you broaden the classifier in code and want previously-escalated claims to get another try, run:

python -m decomp_agents.retry_escalated --all-sessions
# or via the console-script alias:
decomp-agents-retry-escalated --all-sessions

That flips every merge_status='conflict' claim back to 'pending' and archives the existing escalation JSONs under output/merges/escalated/_retried/. Re-run the orchestrator afterwards to drain them.

Limitations / future work

No human-in-the-loop merge resolution UI. Truly-unresolvable conflicts (shared headers, curator-only files) still drop a JSON; prompts/merge_resolver_system.md is the placeholder for a future LLM resolver.
Header coupling not protected. Workers are told not to edit shared headers, but nothing structurally prevents it. The first time it bites, add a pre-commit hook in meteor-decomp's .git/hooks/.
Single-binary per session. Workers only chew through one DECOMP_BINARY per orchestrator run. To grind multiple binaries, run the orchestrator multiple times.
No automatic rate-limit backoff. If the SDK hits a rate limit, workers will surface the error and release the claim with outcome=error. They keep retrying.
Subprocess-per-worker is heavier than asyncio tasks. Switched from in-process tasks for true cwd isolation and clean signal handling. Cost: ~50MB RSS per worker on top of the SDK.

Design notes

See meteor-decomp/AGENTS.md for the per-function discipline rules agents follow (one-file scope, narrow YAML edits, the 13 canonical fixes table).

The orchestrator's design mirrors jayminwest/overstory — multi-agent with worktree isolation and a coordination database — adapted for meteor-decomp's specific build/diff loop.

Sister projects

meteor-decomp — clean-room decompilation of the FFXIV 1.23b Windows client binaries that these agents grind through, producing byte-identical recompiles and validated wire-protocol ground truth.
Garlemald Server — Rust port of the FFXIV 1.23b server emulator (lobby / world / map), backed by SQLite and 1,142 Lua content scripts.
Garlemald Client — cross-platform Rust launcher that patches a 1.x install forward and drives it against a private server (macOS / Linux / Windows).
XIV 1.0 Apple Silicon Installer — one-command installer that stands up a working 1.23b client on Apple Silicon Macs.
XIV 1.0 Linux Installer — one-command installer that stands up a working 1.23b client on x86_64 Linux, across distributions.

License

AGPL-3.0-or-later, matching meteor-decomp/.