← ClaudeAtlas

exp-designlisted

Claim-driven experiment design — scope target claims → design experiment blocks (baseline/validation/ablation/robustness) → build run order → optional Review LLM review → write to wiki
Lambenthan/empiricalwiki · ★ 45 · Web & Frontend · score 80
Install: claude install-skill Lambenthan/empiricalwiki
# /exp-design > Given an idea (or a free-text hypothesis), design a complete experiment plan. > Claims are the core: scope the claims to validate across three dimensions — Target, Decomposition, and Threats. > Design four types of experiment blocks: baseline (reproduce baseline), validation (core verification), ablation (factor isolation), and robustness (stress testing). > Experiments are ordered by dependency with decision gates between stages (sanity fail → early stop). > Optional Review LLM review checks experiment plan completeness. All experiments are written to wiki/experiments/ with graph edges. ## Inputs - `idea`: one of: - A slug from wiki/ideas/ (e.g. `sparse-lora-for-edge-devices`) - A free-text hypothesis description (provide the experiment goal directly) - `--review` (optional): enable Review LLM review to check experiment plan completeness - `--budget <gpu-hours>` (optional): total compute budget cap (GPU hours), affects robustness experiment scope ## Outputs - `wiki/experiments/{slug}.md` — one page per experiment block (status: planned) - `wiki/graph/edges.jsonl` — new tested_by edges: experiment → claim - `wiki/ideas/{slug}.md` — updated linked_experiments field - `wiki/graph/context_brief.md` — rebuilt - `wiki/graph/open_questions.md` — rebuilt - `wiki/log.md` — appended log entry - **EXPERIMENT_PLAN_REPORT** (printed to terminal) — experiment block summary, run order, compute budget ## Wiki Interaction ### Reads - `wiki/ideas/{slug}.md` — idea's