pdf-cascadelisted
Install: claude install-skill roomi-fields/paper-trail
# Skill : pdf-cascade
## Purpose
Wraps the paper-trail worker B's acquisition cascade. Given a single
reference slug or a state filter, it advances the matching refs from
`candidate` toward `page1_validated` through the FSM, with strict page 1
anti-homonymy validation.
Anchors all downloads in the local registry (`pdf_path`, `pdf_sha256`,
`acquisition_attempts[]`) so the curator can audit everything.
## When to invoke
Trigger this skill for any of:
- The user wants to fetch a PDF for a ref by slug
- The user wants to push the whole batch of `candidate` or `uid_resolved`
refs forward
- The user explicitly calls `/paper-trail:cascade`,
`/paper-trail:reactivate-ocr`, or `/paper-trail:status`
- `sota-writer` sub-task needs PDFs acquired for its proposed candidates
Do NOT invoke for semantic decisions (is this citation correct?) — that
belongs to `sota-auditor`.
## How it works
The skill delegates to the worker B Python CLI:
```bash
# Single ref by slug
python -m pipeline run --ref <slug>
# Batch by state filter
python -m pipeline run --state candidate --limit 50
# Dry-run (no mutation)
python -m pipeline run --state candidate --dry-run
# Reactivate refs waiting for OCR
python -m pipeline reactivate-ocr
```
The CLI invokes the 8-source cascade (or 10 sources if
`RESEARCH_ENABLE_SHADOW_LIBS=1` — see DISCLAIMER.md). Each acquired
PDF must pass page 1 validation (author + title similarity ≥ 0.3 +
zero off-domain keywords) before being accepted into the registry.
##