hypothesis-testing

Solid

Applies the scientific method to debugging by helping users form specific, testable hypotheses, design targeted experiments, and systematically confirm or reject theories to find root causes. Use when a user says their code isn't working, they're getting an error, something broke, they want to troubleshoot a bug, or they're trying to figure out what's causing an issue. Concrete actions include isolating failing components, forming and testing hypotheses, analyzing error messages, tracing execution paths, and interpreting test results to narrow down root causes.

Testing & QA 1,177 stars 108 forks Updated today Apache-2.0

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Hypothesis-Driven Debugging You are applying the scientific method to debugging. Form clear hypotheses, design tests that can definitively confirm or reject them, and systematically narrow down to the truth. ## Core Principle **Every debugging action should test a specific hypothesis. Random changes are not debugging.** ## The Scientific Debugging Method ### 1. Observe - Gather Facts Before forming hypotheses, collect observations: - What exactly happens? (specific symptoms) - When does it happen? (timing, frequency) - Where does it happen? (environment, component) - What changed recently? (code, config, data) **Write down observations objectively:** ``` Observations: - API returns 500 error on POST /orders - Happens only when cart has > 10 items - Started after deployment on 2024-01-15 - Works fine in staging environment - Error logs show "connection refused" to inventory service ``` ### 2. Hypothesize - Form Testable Theories **Examples (bad → good):** - ~~"Something is wrong with the network"~~ → "The inventory service connection pool is exhausted when processing orders with >10 items" - ~~"There might be a race condition"~~ → "The order processing timeout (5s) is insufficient for large orders" ### 3. Predict - Define Expected Results For each hypothesis, define what you expect to observe if it is true versus false: ``` Hypothesis: Connection pool exhausted for large orders If TRUE: - Active connections should hit max (20) during large orders - Small orders...

Details

Author
rohitg00
Repository
rohitg00/skillkit
Created
4 months ago
Last Updated
today
Language
TypeScript
License
Apache-2.0

Similar Skills

Semantically similar based on skill content — not just same category

Testing & QA Listed

debugging-testing

Use when the user reports a bug they can't reproduce, asks where to start debugging, or mentions a Heisenbug / production-only failure. Drives the observe→hypothesize→predict→test→iterate scientific method.

62 Updated today
Tibsfox
Code & Development Listed

hypothesis-debugging

Hypothesis-driven debugging protocol — reproduce, isolate, hypothesize, test, fix, document. Use when debugging a failure, a failing test, a flaky CI run, or a production error. Write a ranked hypothesis table before any fix; find the root cause, not a workaround. The operational form of the vibe-engineer-skills principle "hypothesis before help."

0 Updated 6 days ago
HermeticOrmus
Code & Development Listed

debugging

Systematic root-cause debugging: reproduce, investigate, hypothesize, fix with verification. Use when asked to "debug this", "fix this bug", "why is this failing", "troubleshoot", or mentions errors, stack traces, broken tests, flaky tests, regressions, or unexpected behavior.

20 Updated today
iliaal
AI & Automation Listed

hyper-debug

Use when encountering a test failure, exception, regression, or unexpected behavior — before proposing fixes. Also when the user invokes /hyperclaude:hyper-debug. Forms hypotheses systematically instead of guess-and-check thrashing.

3 Updated yesterday
zeikar
Code & Development Listed

systematic-debugging

4-phase root cause analysis: observe, hypothesize, verify, fix. Enforces investigation before any code changes. Emergency stop after 2 failed fixes. Prevents shotgun debugging and fix cascades.

1 Updated today
allysgrandiose674