computer-vision

Solid

Use this skill when building computer vision applications, implementing image classification, object detection, or segmentation pipelines. Triggers on image classification, object detection, YOLO, semantic segmentation, image preprocessing, data augmentation, transfer learning, CNN architectures, vision transformers, and any task requiring visual recognition or image analysis.

Data & Documents 167 stars 29 forks Updated today MIT

Install

View on GitHub

Quality Score: 92/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

When this skill is activated, always start your first response with the 🧢 emoji. # Computer Vision Computer vision enables machines to interpret and reason about visual data - images, video, and multi-modal inputs. Modern CV pipelines are built on deep neural networks pretrained on large datasets (ImageNet, COCO, ADE20K) and fine-tuned for specific domains. PyTorch and its ecosystem (torchvision, timm, ultralytics, albumentations) cover the full stack from data loading through deployment. Foundation models like SAM, DINOv2, and OpenCLIP have shifted best practice toward prompt-based and zero-shot approaches before committing to full training runs. --- ## When to use this skill Trigger this skill when the user: - Trains or fine-tunes an image classifier on a custom dataset - Runs inference with YOLO, DETR, or other detection models - Builds a semantic or instance segmentation pipeline - Implements data augmentation for CV training - Preprocesses images for model ingestion (resize, normalize, batch) - Exports a vision model to ONNX or optimizes with TensorRT - Evaluates a vision model (mAP, confusion matrix, per-class metrics) - Implements a U-Net, DeepLabV3, or similar segmentation architecture Do NOT trigger this skill for: - Pure NLP tasks with no visual component (use a language-model skill instead) - 3D point-cloud processing or LiDAR-only pipelines (overlap is limited; check domain) --- ## Key principles 1. **Start with pretrained models** - Fine-tune ImageNet/C...

Details

Author: AbsolutelySkilled
Repository: AbsolutelySkilled/AbsolutelySkilled
Created: 2 months ago
Last Updated: today
Language: MDX
License: MIT

Similar Skills

Semantically similar based on skill content — not just same category

Data & Documents Listed

computer-vision

3 Updated today

Samuelca6399

AI & Automation Solid

senior-computer-vision

Computer vision engineering skill for object detection, image segmentation, and visual AI systems. Covers CNN and Vision Transformer architectures, YOLO/Faster R-CNN/DETR detection, Mask R-CNN/SAM segmentation, and production deployment with ONNX/TensorRT. Includes PyTorch, torchvision, Ultralytics, Detectron2, and MMDetection frameworks. Use when building detection pipelines, training custom models, optimizing inference, or deploying vision systems.

16,782 Updated 3 days ago

alirezarezvani

AI & Automation Solid

processing-computer-vision-tasks

This skill enables Claude to process and analyze images using computer vision techniques. It's used to perform tasks such as object detection, image classification, and image segmentation. Use this skill when a user requests analysis of an image, asks for identification of objects within an image, or needs help with other computer vision related tasks. Trigger terms include "analyze image", "object detection", "image classification", "image segmentation", "computer vision", "process image", or when the user provides an image and asks for insights.

2,274 Updated today

jeremylongshore

AI & Automation Solid

computer-vision-skill

Specialized skill for robot vision including feature detection, tracking, and camera calibration

1,160 Updated today

a5c-ai

AI & Automation Solid

senior-computer-vision

World-class computer vision skill for image/video processing, object detection, segmentation, and visual AI systems. Expertise in PyTorch, OpenCV, YOLO, SAM, diffusion models, and vision transformers. Includes 3D vision, video analysis, real-time processing, and production deployment. Use when building vision AI systems, implementing object detection, training custom vision models, or optimizing inference pipelines.

27,705 Updated today

davila7