data-quality-profiler

Solid

Profiles data assets to assess quality dimensions, detect anomalies, and generate comprehensive data quality reports with actionable recommendations.

Data & Documents 1,160 stars 71 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Data Quality Profiler Profiles data assets to assess quality dimensions and detect anomalies across the six core data quality dimensions. ## Overview This skill performs comprehensive data profiling to assess completeness, accuracy, consistency, validity, timeliness, and uniqueness. It generates statistical profiles, detects anomalies, identifies PII, and provides actionable recommendations for data quality improvement. ## Capabilities - **Statistical profiling** - Distributions, cardinality, null percentages, min/max values - **Data type inference and validation** - Detect actual vs declared types - **Pattern detection** - Regex patterns, formats, common structures - **Anomaly detection** - Outliers, drift, unexpected values - **Referential integrity checking** - Foreign key validation - **Freshness monitoring** - Data age and update frequency - **Volume trend analysis** - Record count patterns over time - **Schema change detection** - Structural changes between runs - **Cross-column correlation analysis** - Identify dependent columns - **PII detection and classification** - Sensitive data identification ## Input Schema ```json { "dataSource": { "type": "object", "required": true, "properties": { "type": { "type": "string", "enum": ["table", "file", "query"], "description": "Type of data source" }, "connection": { "type": "object", "description": "Connection details (platform, database, schema)"...

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

data-exploration

Profile and explore datasets to understand their shape, quality, and patterns before analysis. Use when encountering a new dataset, assessing data quality, discovering column distributions, identifying nulls and outliers, or deciding which dimensions to analyze.

1 Updated today
Safen99
Data & Documents Listed

profile-dataset

Produce a coverage & quality profile of a Narrative dataset (or access rule): row count, per-column null/fill rate, cardinality, ranges, top-values, inferred column shape, and quality flags. Reads bundled stats + sample first, recovers missing/stale stats by configuring and recalculating them, and escalates to a cheap `/write-nql` query only for a measure no stat can provide. Descriptive, not prescriptive. Use when: "profile dataset N", "what does dataset N look like", "coverage and quality of <dataset>", "what id types does N emit", "null rates / cardinality for <dataset>", "is this dataset's stats fresh". (narrative-common)

4 Updated 3 days ago
narrative-io
Data & Documents Solid

data-quality

Use this skill when implementing data validation, data quality monitoring, data lineage tracking, data contracts, or Great Expectations test suites. Triggers on schema validation, data profiling, freshness checks, row-count anomalies, column drift, expectation suites, contract testing between producers and consumers, lineage graphs, data observability, and any task requiring data integrity enforcement across pipelines.

167 Updated today
AbsolutelySkilled
Data & Documents Listed

data-quality

Use this skill when implementing data validation, data quality monitoring, data lineage tracking, data contracts, or Great Expectations test suites. Triggers on schema validation, data profiling, freshness checks, row-count anomalies, column drift, expectation suites, contract testing between producers and consumers, lineage graphs, data observability, and any task requiring data integrity enforcement across pipelines.

3 Updated today
Samuelca6399
AI & Automation Solid

data-quality-auditor

Audit datasets for completeness, consistency, accuracy, and validity. Profile data distributions, detect anomalies and outliers, surface structural issues, and produce an actionable remediation plan.

16,782 Updated 3 days ago
alirezarezvani