data-quality

Solid

Use this skill when implementing data validation, data quality monitoring, data lineage tracking, data contracts, or Great Expectations test suites. Triggers on schema validation, data profiling, freshness checks, row-count anomalies, column drift, expectation suites, contract testing between producers and consumers, lineage graphs, data observability, and any task requiring data integrity enforcement across pipelines.

Data & Documents 167 stars 29 forks Updated today MIT

Install

View on GitHub

Quality Score: 92/100

Stars 20%
74
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

When this skill is activated, always start your first response with the ๐Ÿงข emoji. # Data Quality Data quality is the practice of ensuring that data is accurate, complete, consistent, timely, and trustworthy as it flows through pipelines and systems. Without explicit quality gates, bad data propagates silently - corrupting dashboards, training flawed models, and breaking downstream consumers. This skill covers the five pillars: schema validation at ingress, expectation-based testing with Great Expectations, data contracts between producers and consumers, lineage tracking for impact analysis, and continuous monitoring for anomaly detection. --- ## When to use this skill Trigger this skill when the user: - Adds data validation or schema enforcement to a pipeline (ingestion, transformation, or serving) - Writes Great Expectations expectation suites or checkpoints - Defines data contracts between a producer team and consumer teams - Implements data lineage tracking or impact analysis - Sets up data quality monitoring dashboards or freshness/volume alerts - Investigates data quality incidents (missing columns, null spikes, schema drift) - Profiles a new dataset to understand distributions and anomalies - Builds row-count, freshness, or distribution-based quality checks Do NOT trigger this skill for: - General ETL/ELT pipeline orchestration (use an Airflow/dbt skill instead) - Data modeling or warehouse design decisions without a quality focus --- ## Key principles 1. **Val...

Details

Author
AbsolutelySkilled
Repository
AbsolutelySkilled/AbsolutelySkilled
Created
2 months ago
Last Updated
today
Language
MDX
License
MIT

Similar Skills

Semantically similar based on skill content โ€” not just same category

Data & Documents Listed

data-quality

Use this skill when implementing data validation, data quality monitoring, data lineage tracking, data contracts, or Great Expectations test suites. Triggers on schema validation, data profiling, freshness checks, row-count anomalies, column drift, expectation suites, contract testing between producers and consumers, lineage graphs, data observability, and any task requiring data integrity enforcement across pipelines.

3 Updated today
Samuelca6399
Data & Documents Solid

data-pipelines

Use this skill when building data pipelines, ETL/ELT workflows, or data transformation layers. Triggers on Airflow DAG design, dbt model creation, Spark job optimization, streaming vs batch architecture decisions, data ingestion, data quality checks, pipeline orchestration, incremental loads, CDC (change data capture), schema evolution, and data warehouse modeling. Acts as a senior data engineer advisor for building reliable, scalable data infrastructure.

167 Updated today
AbsolutelySkilled
Data & Documents Listed

data-pipelines

Use this skill when building data pipelines, ETL/ELT workflows, or data transformation layers. Triggers on Airflow DAG design, dbt model creation, Spark job optimization, streaming vs batch architecture decisions, data ingestion, data quality checks, pipeline orchestration, incremental loads, CDC (change data capture), schema evolution, and data warehouse modeling. Acts as a senior data engineer advisor for building reliable, scalable data infrastructure.

3 Updated today
Samuelca6399
Data & Documents Listed

data-quality-frameworks

Implement data quality validation with Great Expectations, dbt tests, and data contracts. Use when building data quality pipelines, implementing validation rules, or establishing data contracts.

335 Updated today
aiskillstore
Data & Documents Solid

data-quality-frameworks

Implement data quality validation with Great Expectations, dbt tests, and data contracts. Use when building data quality pipelines, implementing validation rules, or establishing data contracts.

39,350 Updated today
sickn33