edge-deployment-skill

Solid

ML model optimization and deployment on robot edge devices (Jetson, embedded)

AI & Automation 1,160 stars 71 forks Updated today MIT

Install

View on GitHub

Quality Score: 97/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
58
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Edge Deployment Skill ## Overview Expert skill for optimizing and deploying machine learning models on robot edge devices including NVIDIA Jetson and embedded systems. ## Capabilities - Configure TensorRT optimization for NVIDIA Jetson - Set up ONNX model conversion and optimization - Implement INT8 and FP16 quantization - Configure DeepStream for video analytics - Set up CUDA graph optimization - Implement model pruning and distillation - Configure DLA (Deep Learning Accelerator) deployment - Set up multi-stream inference - Implement ROS2 inference nodes - Profile and benchmark on target hardware ## Target Processes - nn-model-optimization.js - object-detection-pipeline.js - rl-robot-control.js - field-testing-validation.js ## Dependencies - TensorRT - ONNX Runtime - NVIDIA Jetson SDK - DeepStream ## Usage Context This skill is invoked when processes require deploying ML models on edge devices with optimized inference performance. ## Output Artifacts - TensorRT engine files - ONNX optimized models - Quantization configurations - DeepStream pipeline configs - Inference benchmark reports - ROS2 inference node implementations

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

object-detectionsegmentation-skill

Deep learning based object detection and segmentation for robotics applications

1,160 Updated today
a5c-ai
AI & Automation Listed

datarobot-model-deployment

Tools and guidance for deploying DataRobot models, managing deployments, configuring prediction environments, and deployment operations. Use when deploying models, creating or updating deployments, or configuring prediction environments.

16 Updated 2 days ago
datarobot-oss
AI & Automation Solid

deploying-machine-learning-models

This skill enables Claude to deploy machine learning models to production environments. It automates the deployment workflow, implements best practices for serving models, optimizes performance, and handles potential errors. Use this skill when the user requests to deploy a model, serve a model via an API, or put a trained model into a production environment. The skill is triggered by requests containing terms like "deploy model," "productionize model," "serve model," or "model deployment."

2,274 Updated today
jeremylongshore
AI & Automation Solid

optimizing-deep-learning-models

This skill optimizes deep learning models using various techniques. It is triggered when the user requests improvements to model performance, such as increasing accuracy, reducing training time, or minimizing resource consumption. The skill leverages advanced optimization algorithms like Adam, SGD, and learning rate scheduling. It analyzes the existing model architecture, training data, and performance metrics to identify areas for enhancement. The skill then automatically applies appropriate optimization strategies and generates optimized code. Use this skill when the user mentions "optimize deep learning model", "improve model accuracy", "reduce training time", or "optimize learning rate".

2,274 Updated today
jeremylongshore
AI & Automation Listed

embedded-ai-deployment

Deploy AI models to embedded hardware using MathWorks tools (MATLAB, Simulink, Embedded Coder). Covers two workflow patterns: (1) MathWorks-native or 3P-imported models rebuilt as dlnetwork for lean hardware (Cortex-M, DSP), (2) direct C/C++ code generation from PyTorch and LiteRT models for high-performance hardware (Cortex-A, x86, GPU). Trigger when: user wants to deploy AI to embedded targets; generate C/CUDA from neural networks; compress AI models for MCU/DSP; integrate AI in Simulink for system-level simulation; import PyTorch/ONNX/TensorFlow models for embedded deployment; optimize AI for resource-constrained hardware; or use loadPyTorchExportedProgram, importNetworkFromPyTorch, dlquantizer, exportNetworkToSimulink, or Embedded Coder with AI models.

117 Updated 2 days ago
matlab