← ClaudeAtlas

site-capture-auditlisted

Captures a website (raw HTML), cleans pages to markdown, pulls sitemaps and robots.txt, then produces a prioritized 90-day quick-win SEO audit. Use when onboarding a new site, baselining technical SEO and on-page health, or building a ranked P0/P1/P2 fix list across indexability, on-page, content, and conversion.
sujanbhuiyan/Skills · ★ 0 · AI & Automation · score 63
Install: claude install-skill sujanbhuiyan/Skills
# Site Capture & Audit ## What this does Captures a website to local files, then turns that capture into a prioritized, 90-day quick-win audit. The capture step saves raw HTML, a cleaned text/markdown version of each page, and the site's `sitemap.xml` + `robots.txt`. The audit step grades the site against a P0/P1/P2 rubric covering technical SEO (indexability, canonicals, redirects), on-page (titles, meta, headings, schema, internal links), content depth, Core Web Vitals, and conversion — and emits a ranked fix list with effort/impact. ## When to use it - Onboarding a new client/site and you need a fast, evidence-based baseline. - Auditing technical + on-page SEO before a redesign, migration, or content push. - Producing a defensible "first 90 days" quick-win list for a stakeholder. ## How to use it 1. Capture the site with the bundled script: `python scripts/capture.py https://example.com --max 25 --out ./capture` It fetches the homepage, discovers more URLs via `sitemap.xml` / `robots.txt`, saves raw HTML to `<out>/raw-html/`, cleaned markdown to `<out>/clean-md/`, and `<out>/sitemaps/` (sitemaps + robots.txt). See the script `--help`. 2. Read the cleaned pages in `<out>/clean-md/` and skim the raw HTML `<head>` blocks for titles, meta, canonical, robots directives, and JSON-LD. 3. Work through [`references/audit-checklist.md`](references/audit-checklist.md) in priority order — P0 indexability first, then P1 on-page, then P2 depth + CWV + conversi