benchmarking-analyst

Solid

Benchmarking analysis skill for performance comparison and best practice identification.

AI & Automation 1,160 stars 71 forks Updated today MIT

Install

View on GitHub

Quality Score: 96/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# benchmarking-analyst You are **benchmarking-analyst** - a specialized skill for benchmarking analysis including performance comparison and best practice identification. ## Overview This skill enables AI-powered benchmarking including: - Internal benchmarking - Competitive benchmarking - Functional benchmarking - Generic/best-in-class benchmarking - Gap analysis - Best practice identification - Adaptation planning - Performance tracking ## Capabilities ### 1. Benchmarking Project Setup ```python from dataclasses import dataclass from typing import List, Dict, Optional from datetime import datetime from enum import Enum class BenchmarkType(Enum): INTERNAL = "internal" # Compare within organization COMPETITIVE = "competitive" # Compare with competitors FUNCTIONAL = "functional" # Compare similar functions across industries GENERIC = "generic" # Compare with best-in-class anywhere @dataclass class BenchmarkProject: title: str benchmark_type: BenchmarkType process_area: str metrics: List[str] partners: List[str] owner: str def setup_benchmark_project(project: BenchmarkProject): """ Set up benchmarking project structure """ phases = { "1_planning": { "status": "in_progress", "tasks": [ {"task": "Identify what to benchmark", "status": "complete"}, {"task": "Identify benchmark partners", "status": "complete"}, {"task": "Determine data ...

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

benchmark-suite-creator

Create benchmark suite creator operations. Auto-activating skill for Performance Testing. Triggers on: benchmark suite creator, benchmark suite creator Part of the Performance Testing skill category. Use when working with benchmark suite creator functionality. Trigger with phrases like "benchmark suite creator", "benchmark creator", "benchmark".

2,274 Updated today
jeremylongshore
AI & Automation Solid

performance-benchmark-suite

SDK performance benchmarking and regression detection

1,160 Updated today
a5c-ai
Data & Documents Solid

benchmark-report-creator

Use PROACTIVELY when creating research reports, experiment writeups, technical whitepapers, or empirical study documentation. Orchestrates the complete benchmark report pipeline with structure, diagrams, hi-res PNG capture, and PDF export. Provides working scripts, CSS templates, and complete command sequences for publication-quality AI/ML benchmark reports. Not for slides, blog posts, or simple README files.

364 Updated today
majiayu000
AI & Automation Solid

benchmark

Use this skill to measure performance baselines, detect regressions before/after PRs, and compare stack alternatives.

201,447 Updated yesterday
affaan-m
AI & Automation Listed

skill-benchmarking

Run skill benchmarks with discriminating-only assertions against evals.json for any model and any AI agent. Use when benchmarking a skill against a model not yet tested, running with_skill/without_skill eval pairs, producing benchmark-<model>.json, re-grading an existing run, adding Phase 2 model comparison results, reviewing results in the eval viewer, updating README benchmark tables, or cleaning non-discriminating assertions from evals.json. Enforces strict grader isolation (the context that generates responses never grades them) and evidence-only passing (assertions pass only on explicit content, never on implication or charity). Works with Claude Code, Gemini CLI, GitHub Copilot, Cursor, and any other AI coding assistant.

1 Updated today
christim427-rgb