fetch-sitemap

Solid

Extract URLs from an XML sitemap with optional regex filtering

AI & Automation 48 stars 5 forks Updated 1 weeks ago MIT

Install

View on GitHub

Quality Score: 87/100

Stars 20%

Recency 20%

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Fetch Sitemap URLs Extract URLs from an XML sitemap with optional regex filtering. ## Arguments - `$0`: URL (required, must start with `http://` or `https://`) - If the URL ends with `.xml`, use it directly as the sitemap URL (backward compatible) - Otherwise, run the auto-discovery logic below - `$1`: an extended regex pattern for filtering (optional) If `$0` is empty, display the usage below and stop: ``` Usage: /fetch-sitemap <url> [pattern] Examples: /fetch-sitemap https://kotlinlang.org/docs /fetch-sitemap https://example.com/sitemap.xml /fetch-sitemap https://example.com docs /fetch-sitemap https://example.com/sitemap.xml 'skills|hooks' ``` If `$0` does not start with `http://` or `https://`, inform the user that a valid URL is required and stop. ## Sitemap Auto-Discovery When the URL does **not** end with `.xml`, automatically discover the sitemap by probing the following locations **one at a time, stopping as soon as one produces output** (do NOT run probes in parallel): **Probes 1–2** — fetch and extract in a single curl: 1. `{url}/sitemap.xml` — path-specific (e.g., `https://kotlinlang.org/docs/sitemap.xml`) 2. `{origin}/sitemap.xml` — site root (e.g., `https://kotlinlang.org/sitemap.xml`), where `{origin}` is the scheme + host of the URL ```bash curl -sfL --compressed --connect-timeout 5 --max-time 10 <probe-url> | grep -oE '<loc>[^<]+</loc>' | sed 's/<loc>//;s/<\/loc>//' ``` If the output is non-empty, the sitemap is found **and the URLs...

Details

Author: LeeJuOh
Repository: LeeJuOh/claude-code-zero
Created: 6 months ago
Last Updated: 1 weeks ago
Language: JavaScript
License: MIT

Integrates with

Anthropic · AI

Bundled in these plugins

claude-code-zero

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

sitemap-audit

XML sitemap audit — find and fix the sitemap problems that quietly waste crawl budget and slow indexing. Discovers the sitemap (robots.txt, /sitemap.xml, sitemap index), validates structure and size limits, and cross-checks the URLs it lists against reality: non-200 / redirected / noindex / canonicalized-away URLs that shouldn't be in a sitemap, plus indexable pages that are missing from it. Reviews lastmod accuracy, sitemap-index organization, and robots.txt reference. Use this skill whenever the user asks about sitemaps, sitemap errors in Search Console, "sitemap couldn't fetch / has errors", crawl budget, pages not getting indexed, or whether their sitemap is clean. Trigger on: "sitemap", "sitemap.xml", "XML sitemap", "sitemap errors", "sitemap audit", "couldn't fetch sitemap", "crawl budget", "pages not indexed sitemap", "sitemap index", "lastmod", "robots.txt sitemap", or any sitemap/crawl-coverage question. For a full-site SEO audit use /seo-analysis; for broken links use /broken-link-checker.

3,289 Updated 5 days ago

nowork-studio

Data & Documents Listed

seo-sitemap

Analyze existing XML sitemaps or generate new ones with industry templates. Validates format, URLs, and structure. Use when user says "sitemap", "generate sitemap", "sitemap issues", or "XML sitemap".

1 Updated 2 days ago

lasri97

Web & Frontend Solid

sitemapkit

Discover and extract sitemaps from any website using SitemapKit. Use this skill whenever the user wants to find pages on a website, get a list of URLs from a domain, audit a site's structure, crawl a sitemap, check what pages exist on a site, or do anything involving sitemaps or site URL discovery — even if they don't explicitly say "sitemap". Requires the sitemapkit MCP server configured with a valid SITEMAPKIT_API_KEY.

402 Updated today

aiskillstore