← ClaudeAtlas

pdf-to-word-docxlisted

PDF conversion toolkit featuring AI layout analysis and OCR. Converts PDFs to Word/Docx, Markdown, JSON, PPT, CSV, HTML, and XML for seamless LLM data processing.
ComPDFKit/compdf-skills · ★ 100 · Data & Documents · score 79
Install: claude install-skill ComPDFKit/compdf-skills
# pdf to word ## Purpose - Wraps the `ComPDFKitConversion` Python SDK into a reusable local conversion workflow, supporting PDF / image to Word, PPT, Excel, HTML, RTF, Image, TXT, JSON, Markdown, and CSV (10 output formats in total). ## Agent Skills Standard Compatibility - This Skill uses an Anthropic Agent Skills-compatible directory structure: `pdf-to-word-docx/`. - The entry point is `SKILL.md`; helper scripts are placed in `scripts/`. - The document uses `$ARGUMENTS` and `${CLAUDE_SKILL_DIR}` conventions for distribution and execution in Claude Code / Agent Skills-compatible environments. ## Input / Output - Input: The target format (`word`/`excel`/`ppt`/`html`/`rtf`/`image`/`txt`/`json`/`markdown`/`csv`), the PDF or image path, and the output path are passed via Skill arguments or the command line. An optional PDF password and conversion parameters may also be provided. - Supported input file types: - PDF files (`.pdf`) - Image files (`.jpg`/`.jpeg`/`.png`/`.bmp`/`.tif`/`.tiff`/`.webp`/`.jp2`/`.gif`/`.tga`) - Output: A file in the corresponding format (`.docx`, `.pptx`, `.xlsx`, `.html`, `.rtf`, image, `.txt`, `.json`, `.md`, `.csv`), or a clear error message. ## Prerequisites - Supports Windows and macOS. - The conversion SDK must be installed first: ```bash pip install ComPDFKitConversion ``` - On first run, the script automatically downloads `license.xml` from the ComPDF server and caches it in the `scripts/` directory: ```text https://download.comp