when-debugging-ml-training-use-ml-training-debugger
SolidDebug ML training issues and optimize performance including loss divergence, overfitting, and slow convergence
Install
Quality Score: 85/100
Skill Content
Details
- Author
- aiskillstore
- Repository
- aiskillstore/marketplace
- Created
- 5 months ago
- Last Updated
- today
- Language
- Python
- License
- None
Similar Skills
Semantically similar based on skill content — not just same category
ml-training-debugger
Diagnose machine learning training failures including loss divergence, mode collapse, gradient issues, architecture problems, and optimization failures. This skill spawns a specialist ML debugging ...
ml-model-training
Train ML models with scikit-learn, PyTorch, TensorFlow. Use for classification/regression, neural networks, hyperparameter tuning, or encountering overfitting, underfitting, convergence issues.
when-developing-ml-models-use-ml-expert
Specialized ML model development, training, and deployment workflow
ml-antipattern-validator
Prevents 30+ critical AI/ML mistakes including data leakage, evaluation errors, training pitfalls, and deployment issues. Use when working with ML training, testing, model evaluation, or deployment.
paper-train
Use this skill when the user wants to configure training parameters, set hyperparameters, debug training issues, or analyze training results. Triggers include: "training config", "hyperparameters", "learning rate", "batch size", "training parameters", "training failed", "loss is NaN", "OOM error", "training debugging". Also use when evaluating trained models or generating result tables and figures.