I built a VAD that beats Silero, Pyannote, and WebRTC on noisy audio — here's how (opens in new tab)
I built NOVA-VAD — a lightweight, explainable Voice Activity Detector that beats every major open source VAD on real-world noisy audio. GitHub:( Benchmark (100 held-out files, never seen during training) Model Accuracy Lightweight Explainable WebRTC VAD 58.0% ✅ ❌ Pyannote VAD 62.0% ❌ ❌ Silero VAD 87.0% ❌ ❌ NOVA-VAD 93.0% ✅ ✅ No PyTorch or GPU required — pure scikit-learn Explains every decision with confidence scores and feature importance Built-in denoiser pipeline Retrainable on your own da...
Read the original article