← ClaudeAtlas

parakeetlisted

Local speech-to-text via Handy app (push-to-talk) and NeMo CLI scripts. Parakeet V3: 25 languages, auto-detection, ~30x realtime on M4 Max, 6% WER. This skill should be used when transcribing audio files or dictating voice input.
tdimino/claude-code-minoan · ★ 32 · Data & Documents · score 85
Install: claude install-skill tdimino/claude-code-minoan
# Parakeet Dictation Skill Local speech-to-text powered by NVIDIA Parakeet TDT 0.6B V3 (~600MB model, 100% offline). ## Two Modes ### 1. Handy App (Primary — Push-to-Talk into Any Text Field) [Handy](https://handy.computer/) is a free, open-source Tauri app (Rust + React) providing push-to-talk dictation with Parakeet V3 built in. Inference via [transcribe-rs](https://github.com/cjpais/transcribe-rs) (ONNX Runtime, int8 quantized). ```bash brew install --cask handy ``` - **Default hotkey**: ⌥Space (Option-Space) on macOS, Ctrl-Space on Windows/Linux - **Modes**: Push-to-talk (hold) or toggle (press to start/stop) - Select **Parakeet V3** in Settings → Models (auto-downloads ~478MB) - Grant microphone + accessibility permissions - Includes VAD (Silero), model management UI - **Additional models**: Whisper (Small/Medium/Turbo/Large), Moonshine, SenseVoice - Models stored at `~/Library/Application Support/com.pais.handy/models/` ### 2. CLI Scripts (Claude Code File Transcription & Terminal Dictation) CLI scripts remain for headless/terminal use within Claude Code. These use NeMo/PyTorch. ## Performance | System | Speed | Engine | |--------|-------|--------| | **Handy (M4 Max)** | ~30x realtime | transcribe-rs / ONNX int8 | | **Handy (Zen 3)** | ~20x realtime | transcribe-rs / ONNX int8 | | **Handy (Skylake i5)** | ~5x realtime | transcribe-rs / ONNX int8 | | **NeMo CLI (MPS)** | Varies | NeMo / PyTorch | - **Accuracy**: 6.05% WER (Word Error Rate) - **Languages**: 25