dyno-debuglisted

Debug failed Dynobox runs from the public dynobox CLI package, especially matrix runs where the same scenario behaves differently across harnesses, models, or iterations. Use this skill when investigating dynobox run failures, flaky agent behavior, failed assertions, missing tool calls, debug artifacts, work directories, transcripts, or when deciding whether to rerun individual jobs versus preserving the original matrix result.
dynobox/dynobox · ★ 0 · AI & Automation · score 68

Install: claude install-skill dynobox/dynobox

# Dyno Debug This skill investigates Dynobox failures without losing the original run signal. Assume `dynobox` is the installed public CLI package unless the user explicitly says they are debugging the local Dynobox repository. Treat a matrix run as evidence: rerunning one failed job can be useful, but it can also erase the cross-harness behavior that made the failure important. ## First Response Before rerunning anything, capture the original result: ```bash dynobox run <path> --debug ``` If the user already provided output, summarize: - scenario, harness, model, iteration, and status - failed assertions and expected order/content - observed tool commands - work directories and debug artifact paths, if present - which jobs passed in the same matrix Call out cross-harness differences explicitly. For example, "Claude passed and Codex failed in the same run" is usually more important than the isolated assertion failure. ## Preserve Matrix Context Do not treat isolated reruns as a replacement for the original matrix result. Record both: - original matrix outcome - isolated rerun outcome If an isolated rerun passes after a matrix failure, report it as nondeterminism or run-context sensitivity unless evidence points to a deterministic bug. Useful language: ```text The rerun passed, but that does not invalidate the original matrix failure. The original run captured a same-scenario, cross-harness difference that should be preserved as a first-class debugging artifact.