data-science

Solid

Use this skill when performing exploratory data analysis, statistical testing, data visualization, or building predictive models. Triggers on EDA, pandas, matplotlib, seaborn, hypothesis testing, A/B test analysis, correlation, regression, feature engineering, and any task requiring data analysis or statistical inference.

Data & Documents 167 stars 29 forks Updated today MIT

Install

View on GitHub

Quality Score: 92/100

Stars 20%
74
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

When this skill is activated, always start your first response with the ๐Ÿงข emoji. # Data Science A practitioner's guide for exploratory data analysis, statistical inference, and predictive modeling. Covers the full analytical workflow - from raw data to reproducible conclusions - with an emphasis on *when* to apply each technique, not just *how*. Designed for engineers and analysts who can code but need opinionated guidance on statistical rigor and common traps. --- ## When to use this skill Trigger this skill when the user: - Loads a new dataset and wants to understand its structure and distributions - Needs to clean, reshape, or impute missing data in a pandas DataFrame - Runs a hypothesis test (t-test, chi-square, ANOVA, Mann-Whitney) - Analyzes an A/B test or experiment result for statistical significance - Builds a correlation matrix or investigates feature relationships - Plots distributions, trends, or model diagnostics with matplotlib or seaborn - Engineers features for a machine learning model - Fits a linear or logistic regression and needs to interpret coefficients - Calculates confidence intervals, p-values, or effect sizes - Needs to choose the right statistical test for their data type Do NOT trigger this skill for: - Deep learning / neural network architecture (use an ML engineering skill) - Data engineering pipelines, ETL, or streaming (use a data engineering skill) --- ## Key principles 1. **Visualize before modeling** - Plot every variable before fi...

Details

Author
AbsolutelySkilled
Repository
AbsolutelySkilled/AbsolutelySkilled
Created
2 months ago
Last Updated
today
Language
MDX
License
MIT

Similar Skills

Semantically similar based on skill content โ€” not just same category

Data & Documents Listed

data-science

Use this skill when performing exploratory data analysis, statistical testing, data visualization, or building predictive models. Triggers on EDA, pandas, matplotlib, seaborn, hypothesis testing, A/B test analysis, correlation, regression, feature engineering, and any task requiring data analysis or statistical inference.

3 Updated today
Samuelca6399
Data & Documents Featured

data-scientist

Expert data scientist for advanced analytics, machine learning, and statistical modeling. Handles complex data analysis, predictive modeling, and business intelligence.

27,705 Updated today
davila7
Data & Documents Solid

analytics-engineering

Use this skill when building dbt models, designing semantic layers, defining metrics, creating self-serve analytics, or structuring a data warehouse for analyst consumption. Triggers on dbt project setup, model layering (staging, intermediate, marts), ref() and source() usage, YAML schema definitions, metrics definitions, semantic layer configuration, dimensional modeling, slowly changing dimensions, data testing, and any task requiring analytics engineering best practices.

167 Updated today
AbsolutelySkilled
Data & Documents Listed

analytics-engineering

Use this skill when building dbt models, designing semantic layers, defining metrics, creating self-serve analytics, or structuring a data warehouse for analyst consumption. Triggers on dbt project setup, model layering (staging, intermediate, marts), ref() and source() usage, YAML schema definitions, metrics definitions, semantic layer configuration, dimensional modeling, slowly changing dimensions, data testing, and any task requiring analytics engineering best practices.

3 Updated today
Samuelca6399
Data & Documents Solid

data-pipelines

Use this skill when building data pipelines, ETL/ELT workflows, or data transformation layers. Triggers on Airflow DAG design, dbt model creation, Spark job optimization, streaming vs batch architecture decisions, data ingestion, data quality checks, pipeline orchestration, incremental loads, CDC (change data capture), schema evolution, and data warehouse modeling. Acts as a senior data engineer advisor for building reliable, scalable data infrastructure.

167 Updated today
AbsolutelySkilled