HPSS Technology: The Science of Making AI Music Sound Human
Harmonic-Percussive Source Separation (HPSS) is a signal processing technique that separates audio into two components: harmonic sounds (like vocals and sustained instruments) and percussive sounds (like drums and rhythmic elements).
Why HPSS Works for AI Detection Bypass
AI music detectors primarily analyze the background texture of a track — the noise floor, spectral patterns, and frequency distribution of non-vocal elements. HPSS allows us to:
- Isolate the vocal layer — Keep it completely untouched (harmonic component)
- Process only the background — Apply encoding that modifies spectral patterns AI detectors look for (percussive component)
- Recombine with optimized mix ratio — 1.6:0.4 (vocal:background) for natural listening
vitqa's Optimized v2 Parameters
- vocal_ratio: 0.24 — 24% vocal presence in mix
- background bitrate: 48k — Background layer encoded at 48kbps
- output bitrate: 128k — Final output at 128kbps CBR
- mix_ratio: 1.6:0.4 — Natural-sounding blend of processed layers
The Result
12.5% average AI probability across 20+ test tracks — the lowest achieved by any method tested, while maintaining full audio quality.