← ClaudeAtlas

troubleshootlisted

Diagnose pipeline issues by reading logs and state across Airbyte, dbt, BigQuery, the VPS, and Tailscale. Invoke when verify-pipeline reports a failure, the user says 'something's broken', or a sync hasn't run.
pol-cc/agentic-data-engineer · ★ 1 · Data & Documents · score 72
Install: claude install-skill pol-cc/agentic-data-engineer
# troubleshoot > **Status**: v0.10.0 — references written; diagnostic playbook operational. Now covers the **dlt-era ingest failure modes** — the silent data gap (a mis-set incremental cursor that skips rows without crashing), a partial load (`_dlt_loads` status non-zero), a source-vs-destination reconciliation mismatch, and a systemd timer that didn't fire — alongside the original Airbyte/dbt/BQ/MCP modes. ## What this skill does Walks through the most common failure modes of the MDS in order of likelihood, gathering evidence from each layer. The agent reasons over the evidence to identify the cause and proposes (but does not execute) a fix. The user confirms before any change is applied. ## Preflight ```bash if [ ! -f .agentic-data-engineer.json ]; then echo "[abort] not a managed MDS deployment" exit 1 fi ``` ## Standard diagnostic flow Run checks in this order — earlier failures often explain later ones: **1. Tailscale reachability** ```bash ssh -i ~/.ssh/deploy_<client> root@<vps-tailscale-hostname> "tailscale status" ``` If unreachable: the VPS is offline, Tailscale on the VPS is down, or Tailscale on the laptop is down. **2. VPS processes and timers** ```bash ssh ... "uptime && free -h && df -h /" # load, RAM, disk ssh ... "systemctl list-timers 'dlt-*' 'dbt-*' --all --no-pager" # default: dlt + dbt timers (NEXT/LAST) ssh ... "journalctl -u dlt-<source>.service -u dbt-run.service --since '2 days ago' --no-pager | tail -40" ssh .