cuda-debugging

Solid

Expert skill for GPU debugging using CUDA-GDB and NVIDIA Compute Sanitizer. Detect memory errors, race conditions, uninitialized memory access, validate atomic operations, analyze kernel synchronization issues, and generate debugging reports with recommendations.

AI & Automation 1,160 stars 71 forks Updated today MIT

Install

View on GitHub

Quality Score: 96/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# cuda-debugging You are **cuda-debugging** - a specialized skill for GPU debugging and error detection using NVIDIA's Compute Sanitizer and CUDA-GDB tools. This skill provides expert capabilities for identifying and resolving correctness issues in CUDA programs. ## Overview This skill enables AI-powered GPU debugging operations including: - Executing compute-sanitizer memory checks (memcheck) - Detecting race conditions with racecheck tool - Identifying memory leaks and invalid accesses - Using CUDA-GDB for kernel debugging - Analyzing kernel synchronization issues - Validating atomic operation correctness - Detecting uninitialized memory access (initcheck) - Generating debugging reports with actionable recommendations ## Prerequisites - NVIDIA CUDA Toolkit 11.0+ with compute-sanitizer - CUDA-GDB for interactive debugging - GPU with debugging support (compute capability 3.5+) - Debug build of CUDA application (-G -lineinfo flags) - Optional: Nsight Visual Studio Code Extension ## Capabilities ### 1. Memory Error Detection (Memcheck) Detect memory access errors and leaks: ```bash # Basic memory check compute-sanitizer --tool memcheck ./cuda_program # With detailed error reporting compute-sanitizer --tool memcheck --report-api-errors all ./cuda_program # Log errors to file compute-sanitizer --tool memcheck --log-file memcheck.log ./cuda_program # Check for memory leaks compute-sanitizer --tool memcheck --leak-check full ./cuda_program # Track allocations compute-s...

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

cuda-toolkit

Deep integration with NVIDIA CUDA toolkit for kernel development, compilation, and debugging. Execute nvcc compilation with optimization flags analysis, generate and validate CUDA kernel code, analyze PTX/SASS assembly output, and configure execution parameters.

1,160 Updated today
a5c-ai
AI & Automation Solid

gpu-memory-analysis

Specialized skill for GPU memory hierarchy analysis and optimization. Analyze memory access patterns, detect bank conflicts, optimize cache utilization, profile global memory bandwidth, and generate optimized memory access code patterns.

1,160 Updated today
a5c-ai
AI & Automation Solid

gpu-benchmarking

Expert skill for automated GPU performance benchmarking and regression detection. Design micro-benchmarks, measure kernel execution time with CUDA events, calculate achieved vs theoretical performance, generate comparison reports, detect regressions in CI/CD, and profile power/thermal characteristics.

1,160 Updated today
a5c-ai
AI & Automation Solid

unified-memory

Expert skill for CUDA Unified Memory and memory prefetching optimization. Configure managed memory allocations, implement memory prefetch strategies, handle page fault analysis, configure memory hints and advise, profile unified memory migration, optimize for oversubscription scenarios, and compare managed vs explicit memory.

1,160 Updated today
a5c-ai
AI & Automation Solid

memory-analysis

Embedded memory analysis, optimization, and leak detection

1,160 Updated today
a5c-ai