rubric-design-validation

Solid

Develop clear scoring rubrics with defined criteria, performance levels, and anchor examples ensuring inter-rater reliability

AI & Automation 1,160 stars 71 forks Updated today MIT

Install

View on GitHub

Quality Score: 96/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Rubric Design and Validation Develop clear scoring rubrics with defined criteria, performance levels, and anchor examples ensuring inter-rater reliability. ## Overview This skill enables the development and validation of scoring rubrics for educational assessment. It encompasses criteria definition, performance level articulation, anchor example selection, and reliability validation to ensure consistent and fair evaluation of student work. ## Capabilities ### Criteria Development - Identify essential performance dimensions - Define observable indicators - Weight criteria appropriately - Ensure comprehensiveness - Avoid overlap between criteria ### Performance Level Definition - Articulate distinct levels - Write clear descriptors - Ensure progressive differentiation - Define score points - Create level labels ### Anchor Examples - Select representative samples - Document exemplars for each level - Annotate scoring rationale - Create training materials - Validate with raters ### Reliability Validation - Conduct inter-rater reliability studies - Calculate agreement statistics - Identify scoring inconsistencies - Revise for clarity - Train and calibrate raters ## Usage Guidelines ### Rubric Development Process 1. Define purpose and use 2. Identify assessment criteria 3. Describe performance levels 4. Draft rubric descriptors 5. Select and annotate anchors 6. Validate with multiple raters 7. Revise based on feedback ### Descriptor Writing - Use concrete, observable l...

Details

Author: a5c-ai
Repository: a5c-ai/babysitter
Created: 4 months ago
Last Updated: today
Language: JavaScript
License: MIT

Similar Skills

Semantically similar based on skill content — not just same category

Code & Development Listed

content-evaluation-framework

This skill should be used when evaluating the quality of book chapters, lessons, or educational content. It provides a systematic 6-category rubric with weighted scoring (Technical Accuracy 30%, Pedagogical Effectiveness 25%, Writing Quality 20%, Structure & Organization 15%, AI-First Teaching 10%, Constitution Compliance Pass/Fail) and multi-tier assessment (Excellent/Good/Needs Work/Insufficient). Use this during iterative drafting, after content completion, on-demand review requests, or before validation phases.

335 Updated today

aiskillstore

AI & Automation Solid

assessment-item-development

Create valid, reliable assessment items across formats (multiple choice, constructed response, performance tasks) following psychometric best practices

1,160 Updated today

a5c-ai

AI & Automation Solid

evaluation-framework

Patterns for building evaluation and scoring systems, quality gates, rubrics, and decision frameworks. Use for any scored assessment.

297 Updated today

athola

AI & Automation Listed

advanced-evaluation

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise comparison, position bias, evaluation pipelines, or automated quality assessment.

3 Updated today

Kalyanikhandare29

AI & Automation Listed

advanced-evaluation

335 Updated today

aiskillstore