computer-vision-skill

Solid

Specialized skill for robot vision including feature detection, tracking, and camera calibration

AI & Automation 1,160 stars 71 forks Updated today MIT

Install

View on GitHub

Quality Score: 97/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
64
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Computer Vision Skill ## Overview Expert skill for robot vision applications including camera calibration, feature detection and tracking, stereo vision, and visual servoing. ## Capabilities - Implement camera intrinsic calibration (pinhole, fisheye) - Configure stereo camera calibration and rectification - Set up camera-LiDAR extrinsic calibration - Implement feature detection (ORB, SIFT, SURF, SuperPoint) - Configure optical flow tracking (Lucas-Kanade, Farneback) - Implement depth estimation from stereo - Set up visual servoing pipelines - Configure image undistortion and rectification - Implement ArUco/AprilTag marker detection - Set up hand-eye calibration ## Target Processes - robot-calibration.js - visual-slam-implementation.js - object-detection-pipeline.js - digital-twin-development.js ## Dependencies - OpenCV - cv_bridge - image_geometry - camera_calibration ## Usage Context This skill is invoked when processes require camera calibration, feature detection, visual tracking, or image processing for robot vision applications. ## Output Artifacts - Camera calibration files (YAML) - Stereo calibration parameters - Feature detection configurations - Visual servoing controllers - Image processing pipelines - Marker detection configurations

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

calibration-tools-skill

Sensor and robot calibration using specialized robotics calibration tools

1,160 Updated today
a5c-ai
Data & Documents Solid

computer-vision

Use this skill when building computer vision applications, implementing image classification, object detection, or segmentation pipelines. Triggers on image classification, object detection, YOLO, semantic segmentation, image preprocessing, data augmentation, transfer learning, CNN architectures, vision transformers, and any task requiring visual recognition or image analysis.

167 Updated today
AbsolutelySkilled
Data & Documents Listed

computer-vision

Use this skill when building computer vision applications, implementing image classification, object detection, or segmentation pipelines. Triggers on image classification, object detection, YOLO, semantic segmentation, image preprocessing, data augmentation, transfer learning, CNN architectures, vision transformers, and any task requiring visual recognition or image analysis.

3 Updated today
Samuelca6399
AI & Automation Solid

senior-computer-vision

Computer vision engineering skill for object detection, image segmentation, and visual AI systems. Covers CNN and Vision Transformer architectures, YOLO/Faster R-CNN/DETR detection, Mask R-CNN/SAM segmentation, and production deployment with ONNX/TensorRT. Includes PyTorch, torchvision, Ultralytics, Detectron2, and MMDetection frameworks. Use when building detection pipelines, training custom models, optimizing inference, or deploying vision systems.

16,782 Updated 3 days ago
alirezarezvani
AI & Automation Solid

robot-perception

Comprehensive best practices for robot perception systems covering cameras, LiDARs, depth sensors, IMUs, and multi-sensor setups. Use this skill when working with RGB image processing, depth maps, point clouds, sensor calibration (intrinsic, extrinsic, hand-eye), object detection, semantic segmentation, 3D reconstruction, visual servoing, or perception pipeline optimization. Trigger whenever the user mentions OpenCV, Open3D, PCL, RealSense, ZED, OAK-D, camera calibration, AprilTags, ArUco markers, stereo vision, RGBD, point cloud filtering, ICP registration, coordinate transforms, camera intrinsics, distortion correction, image undistortion, sensor streaming, frame synchronization, or any computer vision task in a robotics context. Also covers multi-camera rigs, time synchronization across sensors, perception latency budgets, and production deployment of perception pipelines.

251 Updated 6 days ago
arpitg1304