harness-audit

Solid

Score a project's agent harness across 5 subsystems (Instructions / State / Verification / Scope / Lifecycle), identify the bottleneck, and produce a prioritized improvement plan. Use when assessing if a project is ready to graduate to [LONG-RUN] status, when an agent keeps failing despite good models, or when adopting our stack on a new codebase. Do NOT use to design or build a new harness from scratch — this only scores an existing one; for greenfield harness/agent architecture use harness-design (or agent-harness-design).

AI & Automation 138 stars 20 forks Updated today MIT

Install

View on GitHub

Quality Score: 89/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Harness Audit Score a project's agent harness across five subsystems and tell the user which one to fix first. **Source**: Five-subsystem framework adapted from [Learn Harness Engineering](https://walkinglabs.github.io/learn-harness-engineering/) (walkinglabs, MIT). Adapted to our concrete stack: CLAUDE.md, `.claude/rules/`, PROBLEMS.md, `feature_list.json`, `init.sh`, hooks, handoffs, chronicles. ## What This Skill Does Given a project directory, produces a scorecard like this: ``` === Harness Audit: project-xyz === Instructions 4/5 ✓ CLAUDE.md present, modular rules in .claude/rules/ ✗ No project-level REVIEW.md for PR review guidance State 2/5 ✓ .claude/handoffs/ exists (3 files) ✗ No PROBLEMS.md - issues scattered in handoffs ✗ No feature_list.json - scope state not machine-readable Verification 3/5 ✓ Tests run, pytest configured ✗ No init.sh - new sessions take 15+ min to bootstrap ✗ 3-layer gate not documented in CLAUDE.md Scope 3/5 ✓ no-pre-existing-evasion principle in CLAUDE.md ✗ No WIP=1 (no feature_list.json to enforce it) ✗ Definition of Done not explicit Lifecycle 2/5 ✗ No SessionStart hook (no .claude/settings.json) ✗ No Stop hook for clean-state check ~ Manual cleanup convention exists but not enforced Bottleneck: State (2/5) — lack of structured progress tr...

Details

Author: AnastasiyaW
Repository: AnastasiyaW/claude-code-config
Created: 4 months ago
Last Updated: today
Language: Python
License: MIT

Integrates with

pytest · Testing

Bundled in these plugins

claude-code-config

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

harness-eng

Use when designing, evaluating, or simplifying an agent project harness: AGENTS.md/CLAUDE.md rules, startup scripts, progress logs, feature trackers, handoffs, evaluator rubrics, quality documents, repo-local knowledge maps, and mechanical guardrails for coding agents. Especially useful when converting raw agent-workflow notes into a concise, verifiable project control layer.

3 Updated today

MasihMoafi

Code & Development Listed

harness-health-audit

Audit the health of an AI-coding harness — the CLAUDE.md / AGENTS.md files, rules, skills, agents, hooks, and commands that steer an agent in a repository — across four questions a linter cannot answer: is it TRUE (does it match the real code?), is it USED (do skills actually trigger and does the loop actually verify?), is it AFFORDABLE (is it inside the budget where instructions are still followed?), and is it SAFE (is a live, accurate instruction still a good idea — permission bypasses, unpinned remote scripts, untrusted content flowing in as instructions? — safety findings are flagged for the user's decision, never auto-removed). Then surgically correct or remove only what is proven wrong or dead. Use whenever the steering layer may have rotted or may not be working: "하네스 점검해줘", "하네스 드리프트 감사", "CLAUDE.md가 실제랑 맞는지 봐줘", "룰/스킬이 최신인지 확인해줘", "스킬이 제대로 활용되는지 봐줘", "루프 엔지니어링 잘 되고 있는지 검토해줘", "죽은 훅/커맨드 정리해줘", "하네스 안전한지 점검해줘", "audit my harness", "check the rules still match the real stack", "are my skills actually be

3 Updated today

uzysjung

AI & Automation Solid

harness-doctor

Check whether this project's Agentsmith harness is installed correctly and healthy — fires on "is my harness set up right?", "harness doctor", "check my harness". Part of the Agentsmith harness; runs self-contained health checks (CLAUDE.md managed block, settings, verify.conf, scripts, leanness) with a one-line fix for each finding.

255 Updated 3 days ago

PromptPartner