utility
SolidScore candidate agent actions by expected gain, cost, uncertainty, and redundancy to guide dispatch and termination decisions.
Install
Quality Score: 96/100
Skill Content
Details
- Author
- athola
- Repository
- athola/claude-night-market
- Created
- 6 months ago
- Last Updated
- today
- Language
- Python
- License
- MIT
Similar Skills
Semantically similar based on skill content — not just same category
tool-use-eval
Evaluation pattern for agents that call tools / functions - measures tool selection accuracy, argument correctness, recovery from tool errors, and avoidance of unnecessary tool calls. Pairs with eval-suite-planner. Use when: tool-use eval, function-calling eval, agent tool selection, tool invocation accuracy, tool argument validation, tool error recovery, agent tools, function-calling agent, MCP eval.
agent-evaluation
Evaluate LLM agents and tool-using workflows—task success, tool accuracy, latency/cost, safety, and regression suites. Use when shipping agent features, comparing prompts/models, or debugging agent failures.
agent-harness-construction
Design and optimize AI agent action spaces, tool definitions, and observation formatting for higher completion rates.