phd-deepread

Solid

Guided workflow for processing academic PDFs into structured literature notes using Text-First decision tree (PyMuPDF + Tesseract OCR) and Claude-assisted analysis. Perfect for literature review and note-taking in Obsidian.

Data & Documents 49 stars 4 forks Updated 6 days ago MIT

Install

View on GitHub

Quality Score: 86/100

Stars 20%
57
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
80
License 10%
100
Description 5%
100

Skill Content

# PhD Deep Read Workflow This skill implements a sophisticated **PhD Deep Read Workflow** that processes academic PDFs into structured literature notes for Obsidian using a hybrid decision-tree approach. The workflow combines: 1. **Text-First PDF Extraction**: Decision-tree using PyMuPDF (fast text extraction) for searchable PDFs and Tesseract OCR fallback for scanned/complex pages 2. **Structured Note Generation**: Template-driven generation of comprehensive academic literature notes 3. **Critical Thinking Canvas**: JSON Canvas files with 9 interconnected nodes for deep critical analysis 4. **Workflow Automation**: Scripts to automate the 4-stage pipeline ## When to Use This Skill Activate when the user: - Wants to process academic PDFs into structured literature notes - Needs to extract text from PDFs with complex layouts or tables - Wants to generate Obsidian-compatible notes with YAML frontmatter and Dataview callouts - Needs critical thinking canvases for deep analysis of papers - Has a collection of PDFs to process in batch ## Commands The skill provides these main commands: - `setup`: Setup environment and install required tools (PyMuPDF, Tesseract OCR) - `extract`: Extract text and images from PDFs using Text-First decision tree (PyMuPDF + Tesseract OCR) - `generate`: Generate structured literature notes using .clauderules template - `canvas`: Create JSON Canvas files for critical thinking with 9 interconnected nodes - `run`: Run full workflow automation (extra...

Details

Author
heleninsights-dot
Repository
heleninsights-dot/phd-deepread-workflow
Created
2 months ago
Last Updated
6 days ago
Language
Python
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

Data & Documents Listed

extract-from-pdfs

This skill should be used when extracting structured data from scientific PDFs for systematic reviews, meta-analyses, or database creation. Use when working with collections of research papers that need to be converted into analyzable datasets with validation metrics.

335 Updated today
aiskillstore
AI & Automation Listed

deeppapernote

Generate a high-quality deep-reading note for a single paper and write it into an Obsidian-style vault. Use when the user gives a paper title, DOI, URL, arXiv ID, Zotero item, or local PDF and wants a polished Markdown note with strong structure, evidence-based analysis, and figure placeholders.

2 Updated today
opencue
AI & Automation Solid

obsidian-literature-workflow

Use this skill when the user keeps paper notes inside an Obsidian project knowledge base and wants filesystem-first literature review, explicit agent-first Zotero ingestion, `Papers/` plus `Knowledge/` synthesis, collection-wide normalization, and a default literature canvas without Obsidian MCP.

4,135 Updated today
Galaxy-Dawn
Code & Development Listed

deep-review

Expert review workflow for papers, PDFs, arXiv links, markdown drafts, and technical artifacts. Use when the user wants a rigorous review that interviews them first, researches claims, checks math/methods/literature, uses specialist passes when useful, and returns severity-ranked findings with evidence.

0 Updated 1 weeks ago
dipakkrishnan
Data & Documents Listed

pdf-page-extract

Extract rich data from PDF pages including text spans with metadata, rendered PNG images, and page mapping. Creates persistent artifacts for downstream processing.

335 Updated today
aiskillstore