pandas-dataframe-analyzer

Solid

Automated DataFrame analysis skill for statistical summaries, missing value detection, data type inference, and memory optimization recommendations.

AI & Automation 1,160 stars 71 forks Updated today MIT

Install

View on GitHub

Quality Score: 96/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# pandas-dataframe-analyzer ## Overview Automated DataFrame analysis skill for statistical summaries, missing value detection, data type inference, and memory optimization recommendations using pandas and profiling libraries. ## Capabilities - Statistical profiling of DataFrames - Missing value pattern detection - Data type optimization suggestions - Memory footprint analysis - Duplicate detection and handling - Distribution analysis and visualization - Correlation matrix computation - Cardinality analysis for categorical features ## Target Processes - Exploratory Data Analysis (EDA) Pipeline - Data Collection and Validation Pipeline - Feature Engineering Design and Implementation ## Tools and Libraries - pandas - pandas-profiling / ydata-profiling - numpy - scipy (for statistical tests) ## Input Schema ```json { "type": "object", "required": ["dataPath"], "properties": { "dataPath": { "type": "string", "description": "Path to the data file (CSV, Parquet, JSON)" }, "sampleSize": { "type": "integer", "description": "Number of rows to sample for analysis", "default": 10000 }, "profileType": { "type": "string", "enum": ["minimal", "standard", "full"], "default": "standard" }, "outputFormat": { "type": "string", "enum": ["json", "html", "markdown"], "default": "json" } } } ``` ## Output Schema ```json { "type": "object", "required": ["summary", "columns", "rec...

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Listed

pandas-pro

Use when working with pandas DataFrames, data cleaning, aggregation, merging, or time series analysis. Invoke for data manipulation, missing value handling, groupby operations, or performance optimization.

2 Updated today
zacklecon
Data & Documents Solid

pandas-pro

Performs pandas DataFrame operations for data analysis, manipulation, and transformation. Use when working with pandas DataFrames, data cleaning, aggregation, merging, or time series analysis. Invoke for data manipulation tasks such as joining DataFrames on multiple keys, pivoting tables, resampling time series, handling NaN values with interpolation or forward-fill, groupby aggregations, type conversion, or performance optimization of large datasets.

9,537 Updated 1 weeks ago
Jeffallan
Data & Documents Solid

data-quality-profiler

Profiles data assets to assess quality dimensions, detect anomalies, and generate comprehensive data quality reports with actionable recommendations.

1,160 Updated today
a5c-ai
AI & Automation Listed

data-exploration

Profile and explore datasets to understand their shape, quality, and patterns before analysis. Use when encountering a new dataset, assessing data quality, discovering column distributions, identifying nulls and outliers, or deciding which dimensions to analyze.

1 Updated today
Safen99
Data & Documents Listed

data-analysis

Data analysis assistant — read data files, perform statistical analysis, generate charts and reports

0 Updated today
hugo57100