Drax: Speech Recognition with Discrete Flow Matching
huggingface.co·7h·
Discuss: Hacker News
Flag this post

Published on Oct 5

Authors:

,

,

,

,

,

Abstract

Drax, a discrete flow matching framework for ASR, achieves state-of-the-art recognition accuracy with improved efficiency by constructing an audio-conditioned probability path.

AI-generated summary

Diffusion and flow-based non-autoregressive (NAR) models have shown strong promise in large language modeling, however, their potential for automatic speech recognition (ASR) remains largely unexplored. We propose Drax, a discrete flow matching framework for ASR that enables efficient parallel decoding. To better align training with inference, we construct an [audio-conditioned probability pa…

Similar Posts

Loading similar posts...