local-vaultlisted
Install: claude install-skill genli-ai/market-research-skills
# local-vault
Turn a folder of raw files into a **Markdown vault** that an LLM can grep, and
then answer questions over that vault responsibly.
**Mental model:** `SOURCE` = raw files (source of truth). `VAULT` = one `.md` per
source file, carrying retrieval frontmatter (abstract / tags / synonyms) + a
`source` backlink. The vault is the layer the LLM reads; the raw files are where
the user goes to verify.
There are **two distinct jobs** — figure out which the user wants:
- **A. Convert / sync** — they dropped files in and want them in the vault → run
the pipeline (`scripts/sync.py`).
- **B. Retrieve / answer** — they want answers from an existing vault → follow
the *Retrieval & feedback protocol* below. Do **not** run the pipeline for this.
---
## A. Convert / sync
### One-time setup (do this for the user if not already done)
1. **Python deps** (user-level, no venv):
```
python3 -m pip install --user requests python-dotenv pypdf pymupdf4llm openpyxl python-pptx
```
2. **pandoc** (for docx/rtf/odt/epub): `brew install pandoc` (macOS) / distro pkg.
3. **ffmpeg** (only for audio/video transcription): `brew install ffmpeg` (macOS) /
distro pkg. The **whisper engine is auto-selected by platform** — `mlx-whisper`
on Apple Silicon (GPU), `faster-whisper` elsewhere (cross-platform CPU/CUDA) — and
**auto-installed after the user consents** at the first-run prompt (no manual pip
needed). On that first run with audio/video present, the tool shows the mod