product-spec-pdf-parser

Solid

Extract structured FF&E product specs from PDF files — price books, fact sheets, and spec sheets. Claude reads extracted text and structures products into a standardized schedule.

Data & Documents 165 stars 36 forks Updated 3 weeks ago MIT

Install

View on GitHub

Quality Score: 89/100

Stars 20%
74
Recency 20%
90
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# /product-spec-pdf-parser — PDF Product Spec Parser Extract structured FF&E data from product PDF files — price books, fact sheets, configurator sheets, and spec sheets. Uses PyMuPDF for text extraction and Claude's reasoning to parse wildly varying PDF layouts into a standardized schedule. ## Input The user provides PDFs in one of these ways: 1. **File paths** — one or more PDF file paths 2. **Folder path** — a directory containing PDFs (will process all `.pdf` files) 3. **Just invoked** — ask the user for file paths or a folder Also ask (or use defaults): - **Output destination** — Google Sheet, local CSV, or markdown (default: ask) - **Variant depth** — `expand` (one row per variant/SKU, default) or `summarize` (comma-separated variants in one row) ## Output Schema Products are written to the **master Google Sheet** — the same 33-column schema used by all product skills, plus PDF-specific extra columns. When writing to CSV, use the same column order. Read `../../schema/product-schema.md` (relative to this SKILL.md) for the full column reference, field formats, and category vocabulary. Read `../../schema/sheet-conventions.md` for CRUD patterns with MCP tools. Skill-specific column values: - **AG (Source):** `pdf-parser` - **AF (Status):** `saved` - **J (Link):** Blank (no URL for PDFs) - **D (Thumbnail):** Blank (no image URL typically) - **C (Vendor):** Blank (source is PDF, not a retailer) - **V (Sale Price):** Blank (PDFs don't have sale prices) - **AC (Image ...

Details

Author
AlpacaLabsLLC
Repository
AlpacaLabsLLC/skills-for-architects
Created
3 months ago
Last Updated
3 weeks ago
Language
HTML
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category