literature-review-agentlisted

Step 3 of the PaperOrchestra pipeline (arXiv:2604.05018). Execute the literature search strategy from outline.json — discover candidate papers via web search, verify them through Semantic Scholar (Levenshtein > 70 fuzzy title match, temporal cutoff, dedup by paperId), cross-corroborate against Crossref + OpenAlex to flag hallucinated citations, build a BibTeX file, and draft Introduction + Related Work using ≥90% of the verified pool. Runs in parallel with the plotting-agent. TRIGGER when the orchestrator delegates Step 3 or when the user asks to "find citations for my paper", "draft the related work", or "build the bibliography".
Ar9av/PaperOrchestra · ★ 569 · AI & Automation · score 79

Install: claude install-skill Ar9av/PaperOrchestra

# Literature Review Agent (Step 3) Faithful implementation of the Hybrid Literature Agent from PaperOrchestra (Song et al., 2026, arXiv:2604.05018, §4 Step 3, App. D.3, App. F.1 p.46). **Cost: ~20–30 LLM calls.** This is one of the two longest steps (the other is plotting). Wall-time floor is set by Semantic Scholar's 1 QPS verification limit. ## Inputs - `workspace/outline.json` — specifically `intro_related_work_plan` with the Introduction search directions and the 2-4 Related Work methodology clusters - `workspace/inputs/conference_guidelines.md` — used to derive `cutoff_date` - `workspace/inputs/idea.md`, `workspace/inputs/experimental_log.md` — for framing the Intro and grounding the Related Work positioning ## Outputs - `workspace/citation_pool.json` — verified Semantic Scholar metadata for every paper that survived verification - `workspace/refs.bib` — BibTeX file generated from the verified pool - `workspace/drafts/intro_relwork.tex` — drafted Introduction and Related Work sections, written into the template, with the rest of the template preserved verbatim ## Two-phase pipeline (App. D.3) ``` PHASE 1 — Parallel Candidate Discovery For each search direction in introduction_strategy.search_directions: For each limitation_search_query in each related_work cluster: - Use the host's web search tool to discover up to ~10 candidate papers. - Run up to 10 discovery queries in parallel (host-permitting). - Collect (title, snippet, url)