plan-swarm-review

Solid

Iterative plan review using multisampling + focused decomposition. Launches parallel independent agents to find issues that single-pass review misses. 4 escalating rounds: broad -> multisample -> focused -> focused+multisample. Use when: "swarm review", "review plan thoroughly", "multisample review", "deep plan review", "plan swarming", "stress test the plan", or before implementing any plan >500 lines or with >3 interacting components. Also use proactively when a large plan is about to be implemented — catch issues before code, not after.

AI & Automation 126 stars 19 forks Updated 2 days ago MIT

Install

View on GitHub

Quality Score: 92/100

Stars 20%
70
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
80
License 10%
100
Description 5%
100

Skill Content

# Plan Swarm Review Iterative plan hardening through multisampling and focused decomposition. **Core insight**: a single agent misses issues due to attention budget limits. Multiple independent agents reading the same document find different problems (stochastic diversity). Focused decomposition further improves depth per aspect. Iterative fix-then-re-review uncovers issues previously masked by other bugs. Source: deksden (@deksden_notes) — "Plan Swarming" technique, April 2026. Related: Anthropic Harness Design (Generator-Evaluator), deep-review (parallel competency code review). Research backing: - [2502.11027] Sampling diversity in LLM inference — diverse prompts beat identical: +10.8% reasoning, +9.5% code - [2602.09341] AgentAuditor — reasoning tree audit beats majority voting, recovers 65-82% of minority-correct findings - [2602.17875] MultiVer — 4 parallel agents hit 82.7% recall on vulnerability detection (beats fine-tuned models) - [2510.00317] MAVUL — multi-agent vuln detection: +600% vs single-agent - Anthropic Code Review (Mar 2026) — parallel agents raise substantive findings from 16% to 54% ## Modes This skill works in two modes: **Plan mode** (default): review design docs, specs, ADRs, RFCs before implementation. **Code mode**: review code files for bugs and vulnerabilities. Activated when user passes code files instead of a plan, or says "review code", "find vulnerabilities", "security audit". In code mode, aspects shift from plan-oriented (contracts, c...

Details

Author
AnastasiyaW
Repository
AnastasiyaW/claude-code-config
Created
2 months ago
Last Updated
2 days ago
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category