GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
github.com·5h
🔤Tokenization
Preview
Report Post

Whisper

[Blog] [Paper] [Model card] [Colab example]

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

Approach

A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented a…

Similar Posts

Loading similar posts...