O-EENC-SD: Efficient Online End-to-End Neural Clustering for Speaker Diarization
arxiv.org·1d
🤖n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt
Preview
Report Post

View PDF

Abstract:We introduce O-EENC-SD: an end-to-end online speaker diarization system based on EEND-EDA, featuring a novel RNN-based stitching mechanism for online prediction. In particular, we develop a novel centroid refinement decoder whose usefulness is assessed through a rigorous ablation study. Our system provides key advantages over existing methods: a hyperparameter-free solution compared to unsupervised clustering approaches, and a more efficient alternative to current online end-to-end methods, which are computationally costly. We demonstrate that O-EENC-SD is competitive with the state of the art in the two-speaker conversational telephone speech domain, as tested on the CallHome dataset. Our results show that O-EENC-SD provides a g…

Similar Posts

Loading similar posts...