Analyze CUDA kernel performance using NVIDIA Nsight Compute (NCU). Use when the user wants to: (1) profile a CUDA kernel with NCU to generate .ncu-rep files, (2) analyze NCU reports programmatically using the ncu_report Python API, (3) extract specific metrics from NCU profiles (duration, throughput, occupancy, cache hit rates), (4) compare two kernel profiles side-by-side, (5) identify performance bottlenecks in GPU kernels, or (6) work with .ncu-rep files in any way.
Key Features
- Comprehensive skill evaluation and performance tracking
- Community-driven ratings and reviews
- Easy integration with Claude Code
- Regular updates and maintenance
Quick Start
TopRank Skills install meta-pytorch/ncu-analysis
chat Comments (0)
Sign in to join the discussion and leave a comment.
Skill Details
GitHub Stars
68
GitHub Forks
30
Created
Mar 2026
Last Updated
3 months ago
tools
tools debugging
Related Skills
Build your own?
Join 12,000+ developers contributing to the Claude ecosystem.
No comments yet. Be the first to share your thoughts!