verification-looplisted
Install: claude install-skill fredhead88/do-it
# Verification Loop
## Overview
**Observe the running product. Stay blind to how it was built.**
The verification loop is a standing autonomous reviewer that drives shipped work from
"orc says done" to "verified green on prod". It observes the deployed product via a
headless browser, assigns typed evidence to each acceptance criterion, judges
cross-vendor (Codex primary, Claude fallback), and loops to convergence — filing
correctives for hollow work, escalating taste/blockers, and never touching the build.
## Three Core Invariants
1. **Blind-but-watching.** The verifier never sees the build, the diff, or the
builder's reasoning. The judge receives only the typed evidence artifact — never
the worker's explanation.
2. **Evidence-type-locked-to-criterion-type.** A UI criterion requires a
DOM/screenshot observation. A grep is auto-fail for a UI criterion. No criterion
closes without observed, type-matched evidence.
3. **Verifier owns the verdict; the builder cannot overwrite it.** Verdicts live in
`~/.claude/ledger/verified/<spec_id>.yml` — a separate namespace the builder's
`set`/`register` commands never touch.
## The Two-Body Warning
The loop converges **only if orc is running and consuming correctives**. If filed
items sit unconsumed for N ticks, escalate to `NEEDS-HUMAN.jsonl` — do not file
forever into the void.
## The 8-Step Tick
Run with: `node /path/to/verification-loop/tick.mjs --config <project> [--spec NNN-slug] [--dry-run] [--force]`
1. **