harness-audit

Solid

Score a project's agent harness across 5 subsystems (Instructions / State / Verification / Scope / Lifecycle), identify the bottleneck, and produce a prioritized improvement plan. Use when assessing if a project is ready to graduate to [LONG-RUN] status, when an agent keeps failing despite good models, or when adopting our stack on a new codebase.

AI & Automation 126 stars 19 forks Updated 2 days ago MIT

Install

View on GitHub

Quality Score: 89/100

Stars 20%
70
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
80
License 10%
100
Description 5%
100

Skill Content

# Harness Audit Score a project's agent harness across five subsystems and tell the user which one to fix first. **Source**: Five-subsystem framework adapted from [Learn Harness Engineering](https://walkinglabs.github.io/learn-harness-engineering/) (walkinglabs, MIT). Adapted to our concrete stack: CLAUDE.md, `.claude/rules/`, PROBLEMS.md, `feature_list.json`, `init.sh`, hooks, handoffs, chronicles. ## What This Skill Does Given a project directory, produces a scorecard like this: ``` === Harness Audit: project-xyz === Instructions 4/5 ✓ CLAUDE.md present, modular rules in .claude/rules/ ✗ No project-level REVIEW.md for PR review guidance State 2/5 ✓ .claude/handoffs/ exists (3 files) ✗ No PROBLEMS.md - issues scattered in handoffs ✗ No feature_list.json - scope state not machine-readable Verification 3/5 ✓ Tests run, pytest configured ✗ No init.sh - new sessions take 15+ min to bootstrap ✗ 3-layer gate not documented in CLAUDE.md Scope 3/5 ✓ no-pre-existing-evasion principle in CLAUDE.md ✗ No WIP=1 (no feature_list.json to enforce it) ✗ Definition of Done not explicit Lifecycle 2/5 ✗ No SessionStart hook (no .claude/settings.json) ✗ No Stop hook for clean-state check ~ Manual cleanup convention exists but not enforced Bottleneck: State (2/5) — lack of structured progress tr...

Details

Author
AnastasiyaW
Repository
AnastasiyaW/claude-code-config
Created
2 months ago
Last Updated
2 days ago
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category

Code & Development Listed

harness-audit

Scans a project for agent-harness gaps and outputs a prioritised list with paste-ready config snippets to close them. Covers hooks, CI, pre-commit, instruction file quality, init.sh, and session discipline. Use when starting a new project, auditing an existing harness, writing or reviewing CLAUDE.md/AGENTS.md, setting up CI gates, beginning multi-day autonomous work, or when user says "set up harness", "health check", "agent discipline", "project setup", "pre-commit hooks", "stop hook", "verification gap".

0 Updated 4 days ago
ClydeShen
AI & Automation Listed

harness-check

Agent-harness/framework structure auditor for multi-agent systems orchestrated via prompts, markdown, and scripts. ALWAYS invoke this skill when the user asks to 审查框架 / harness 体检 / 框架可靠性 / 框架有没有问题 / 检查我的框架 / 框架需要提升的点 / 大模型能不能稳定触发 / review my agent framework / harness review / audit my agent orchestration. Do not audit an agent harness directly — use this skill first. Does not review business logic or security/secret leaks.

0 Updated 4 days ago
Zane456
Code & Development Listed

harness-guide

Review the current project state and continuously guide the project toward better Harness and Compound Engineering practices. Identify: - behaviors that align with best practices - anti-patterns or weak workflows - missing operational discipline, evals, harnesses, or feedback loops Then recommend the next highest-leverage step, including: - what to do next - why it matters now - what commands, tools, or workflows to use Ask questions one at a time. For each question, provide your recommended answer. If information can be discovered from the codebase, inspect the codebase instead of asking the user.

0 Updated 4 days ago
ClydeShen