Overview
- Skill Key
- danbennettuk/voice-note-to-midi
- Author
- Clawd
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/danbennettuk/voice-note-to-midi
- Latest Commit SHA
- bbf3d30f44e0c2a31796a729c0b418456922d82f
Convert voice notes, humming, and melodic audio recordings to quantized MIDI files using ML-based pitch detection and intelligent post-processing
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 voice-note-to-midi 技能。 若已安装,则直接安装 voice-note-to-midi 技能。
# 🎵 Voice Note to MIDI
Transform your voice memos, humming, and melodic recordings into clean, quantized MIDI files ready for your DAW.
## What It Does
This skill provides a complete audio-to-MIDI conversion pipeline that:
1. **Stem Separation** - Uses HPSS (Harmonic-Percussive Source Separation) to isolate melodic content from drums, noise, and background sounds
2. **ML-Powered Pitch Detection** - Leverages Spotify's Basic Pitch model for accurate fundamental frequency extraction
3. **Key Detection** - Automatically detects the musical key of your recording using Krumhansl-Kessler key profiles
4. **Intelligent Quantization** - Snaps notes to a configurable timing grid with optional key-aware pitch correction
5. **Post-Processing** - Applies octave pruning, overlap-based harmonic removal, and legato note merging for clean output
### Pipeline Architecture
```
Audio Input (WAV/M4A/MP3)
↓
┌─────────────────────────────────────┐
│ Step 1: Stem Separation (HPSS) │
│ - Isolate harmonic content │
│ - Remove drums/percussion │
│ - Noise gating │
└─────────────────────────────────────┘
↓
┌─────────────────────────────────────┐
│ Step 2: Pitch Detection │
│ - Basic Pitch ML model (Spotify) │
│ - Polyphonic note detection │
│ - Onset/offset estimation │
└─────────────────────────────────────┘
↓
┌─────────────────────────────────────┐
│ Step 3: Analysis │
│ - Pitch class distribution │
│ - Key detection │
│ - Dominant note identification │
└─────────────────────────────────────┘
↓
┌─────────────────────────────────────┐
│ Step 4: Quantization & Cleanup │
│ - Timing grid snap │
│ - Key-aware pitch correction │
│ - Octave pruning (harmonic removal) │
│ - Overlap-based pruning │
│ - Note merging (legato) │
│ - Velocity normalization │
└────────────────────────────────...
# 🎵 Voice Note to MIDI
Convert voice memos, humming, and melodic recordings to clean, quantized MIDI files.
[](https://opensource.org/licenses/MIT)
Transform your 3 AM melody ideas into MIDI notes for your DAW — no keyboard required.
## What It Does
```
Voice Note (WAV/M4A/MP3)
↓
┌─────────────────────────────────────┐
│ 1. Stem Separation (HPSS) │
│ Isolate melody from noise │
└─────────────────────────────────────┘
↓
┌─────────────────────────────────────┐
│ 2. Basic Pitch ML (Spotify) │
│ Detect fundamental frequencies │
└─────────────────────────────────────┘
↓
┌─────────────────────────────────────┐
│ 3. Key Detection │
│ Identify musical key │
└─────────────────────────────────────┘
↓
┌─────────────────────────────────────┐
│ 4. Quantization & Cleanup │
│ • Snap to timing grid │
│ • Key-aware pitch correction │
│ • Harmonic pruning (octave/overlap)│
│ • Note merging (legato) │
└─────────────────────────────────────┘
↓
MIDI File → Your DAW
```
## Quick Start
```bash
git clone https://github.com/DanBennettUK/voice-note-to-midi.git
cd voice-note-to-midi
./setup.sh
```
Then convert a voice memo:
```bash
hum2midi my_humming.wav # Creates my_humming.mid
hum2midi voice.wav song.mid # Custom output name
hum2midi hum.wav --key-aware # Auto-detect key & quantize
```
## Features
- **ML-Powered Pitch Detection** — Spotify's Basic Pitch model
- **Key Detection** — Automatic musical key identification
- **Key-Aware Quantization** — Snap notes to detected scale
- **Harmonic Pruning** — Remove overtones, keep fundamentals
- **Legato Merging** — Combine note chunks into sustained tones
- **Configurable Grid** — 1/4, 1/8, 1/16, or 1/32 note quantization
## Usage
```bash
hum2midi <input.wav> [output.mid] [options]
Options:
--grid <value> Qu...
heyixuan2
Bambu Lab 3D printer control and automation. Activate when user mentions: printer status, 3D printing, slice, analyze model, generate 3D, AMS filament, print monitor, Bambu Lab, or any 3D printing task. Full pipeline: search → generate → analyze → colorize → preview → open BS → user slice → print → monitor. Supports all 9 Bambu Lab printers (A1 Mini, A1, P1S, P2S, X1C, X1E, H2C, H2S, H2D).
openstockdata
OpenClaw Skill for stock data analysis
capt-marbles
Generative Engine Optimization (GEO) for AI search visibility. Optimize content to appear in ChatGPT, Perplexity, Claude, and Google AI Overviews. Use when optimizing websites, pages, or content for LLM discoverability and citation.
camopel
Free multi-engine web search via ddgs CLI (DuckDuckGo, Google, Bing, Brave, Yandex, Yahoo, Wikipedia) + arXiv API search. No API keys required. Use when user needs web search, research paper discovery, or when other skills need a search backend. Drop-in replacement for web-search-plus.
camopel
Local arXiv paper manager with semantic search. Crawls arXiv categories, downloads PDFs, chunks content, and indexes with FAISS + Ollama embeddings. No cloud API keys required — everything runs locally.
camohiddendj
DuckDuckGo HTML search scraper CLI with JSON, CSV, OpenSearch, markdown, and compact outputs.