doc-pipeline

Solid

Chain document operations into reusable pipelines

Data & Documents 364 stars 68 forks Updated today MIT

Install

Quality Score: 94/100

Stars 20%

85

Recency 20%

100

Frontmatter 20%

70

Documentation 15%

100

Issue Health 10%

80

License 10%

100

Description 5%

100

Skill Content

# Doc Pipeline Skill ## Overview This skill enables building document processing pipelines - chain multiple operations (extract, transform, convert) into reusable workflows with data flowing between stages. ## How to Use 1. Describe what you want to accomplish 2. Provide any required input data or files 3. I'll execute the appropriate operations **Example prompts:** - "PDF → Extract Text → Translate → Generate DOCX" - "Image → OCR → Summarize → Create Report" - "Excel → Analyze → Generate Charts → Create PPT" - "Multiple inputs → Merge → Format → Output" ## Domain Knowledge ### Pipeline Architecture ``` Stage 1 Stage 2 Stage 3 Stage 4 ┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐ │Extract│ → │Transform│ → │ AI │ → │Output│ │ PDF │ │ Data │ │Analyze│ │ DOCX │ └──────┘ └──────┘ └──────┘ └──────┘ │ │ │ │ └───────────┴───────────┴───────────┘ Data Flow ``` ### Pipeline DSL (Domain Specific Language) ```yaml # pipeline.yaml name: contract-review-pipeline description: Extract, analyze, and report on contracts stages: - name: extract operation: pdf-extraction input: $input_file output: $extracted_text - name: analyze operation: ai-analyze input: $extracted_text prompt: "Review this contract for risks..." output: $analysis - name: report operation: docx-generation input: $analysis template: templates/review_report.docx ou...

Details

Author: majiayu000
Repository: majiayu000/claude-skill-registry
Created: 5 months ago
Last Updated: today
Language: HTML
License: MIT

Similar Skills

Semantically similar based on skill content — not just same category

Data & Documents Listed

pipeline-design

Design ETL/ELT pipelines end-to-end — source connectors, extraction strategies, transform logic, load patterns, idempotency, scheduling, and error handling. Use this skill whenever the user is starting a new ingestion job, planning how data moves from a source (REST API, database, file, webhook, message queue) into a data warehouse or data lake. Also trigger when the user asks about pipeline architecture, incremental vs. full loads, backfill strategies, CDC, retry logic, or orchestration choices (Airflow, Prefect, dbt). This skill should feel like pairing with a senior data engineer on day one of a new pipeline project.

0 Updated 5 days ago

AI & Automation Solid

ln-100-documents-pipeline

Creates complete project documentation system (project docs, reference, tasks, tests). Use when bootstrapping docs from scratch or regenerating all.

480 Updated 3 days ago

AI & Automation Listed

content-pipeline

Orchestration plan for a five-stage long-form content pipeline (raw capture → theme extraction → research → draft → quality gate and fan-out). Defines each stage's input, output, success criteria, and retry rules so a content-marketer agent can chain the existing writing, research, and distribution skills deterministically. Use when user asks about content pipeline, production workflow, blog production line, multichannel fan-out, editorial workflow, 콘텐츠 파이프라인, 장문 워크플로, 블로그 제작 라인, or 멀티채널 확산.

3 Updated today

Data & Documents Listed

doc-writing

Generate API documentation from a route manifest. Use when you have a list of discovered routes and need to produce markdown documentation.

748 Updated 2 days ago

Data & Documents Solid

data-pipelines

Use this skill when building data pipelines, ETL/ELT workflows, or data transformation layers. Triggers on Airflow DAG design, dbt model creation, Spark job optimization, streaming vs batch architecture decisions, data ingestion, data quality checks, pipeline orchestration, incremental loads, CDC (change data capture), schema evolution, and data warehouse modeling. Acts as a senior data engineer advisor for building reliable, scalable data infrastructure.

167 Updated today

AbsolutelySkilled