← ClaudeAtlas

eval-engineerlisted

Use when a user is unsure which Eval Engineer command to run for AI agents/RAG apps, needs onboarding/status for a .galileo workspace, or asks where to start.
Galileo-Agent-Labs/eval-engineer · ★ 33 · AI & Automation · score 80
Install: claude install-skill Galileo-Agent-Labs/eval-engineer
# Eval Engineer Use this skill as the front door for Galileo-backed eval engineering. Keep it as a router: inspect current project readiness, educate the user only enough to choose the workflow, and route to the focused command skill that matches the job. If a focused skill clearly applies, use that focused skill directly instead of running the full core loop here. ## Current Project State Start by checking what the user gave you and what the project already has: - Galileo URL, project/log-stream/experiment/session/trace ID, or symptom - `.galileo/config.yml` - `.galileo/current/debug-packet.json` - `.galileo/current/verification-debug-packet.json` - configured verification commands - metric profile or expected-output contract Then report one compact status block: ```text Eval Engineer works by closing the loop: evidence -> diagnosis -> bounded change -> verification. Current project state: - workspace: ready/missing - evidence: ready/missing/ambiguous - measurement: ready/missing - best next command: /eval-... ``` Do not dump general docs. Name the smallest useful next step. ## Route - Use `/eval-setup` when `.galileo/` or verification config is missing. - Use `/eval-fetch` when the user provides a Galileo URL/ID or needs evidence pulled into `.galileo/current/debug-packet.json`. - Use `/eval-dataset` when the user wants to create, review, accept, reject, or improve eval cases under `.galileo/eval-dataset/`. - Use `/eval-measure` when the question is whether m