Audio ML

Speech Recognition, Music Information Retrieval, Acoustic Modeling, Sound Classification

Feeds to Scour
SubscribedAll
Scoured 9 posts in 11.9 ms

Towards Robust Arabic Speech Emotion Recognition with Deep Learning

 🌊Digital Signal Processing  Content type: Academic
arxiv.org·

Your fingers deserve a break — Voibe dictation lifetime access is on sale for $50

 🎧Vorbis Encoding
macworld.com·

EvilIrving/ai-transcriber: Transcribe and summarize videos, podcasts, local files, and RSS feeds using Whisper and OpenAI-compatible LLMs.

 🎙️Whisper  Content type: Code
github.com··r/WebApps

Build a local voice agent with Red Hat OpenShift AI

 🎙️Whisper
developers.redhat.com·

OpenCode Plugin by Aito's Intelligence

 🐚Shell Automation

Subtitle-Aligned Fine-Tuning of Whisper for Swiss German ASR: Benchmark Contamination, Convention Mismatch, and an Honest Baseline at 25.6% WER (13.8% cWER)

 🎙️Whisper  Content type: Academic
arxiv.org·

LocalClicky - 通过语音在本地控制您的 Mac

 💻Local LLMs  Content type: Code
github.com·

GNSS-FM: A Self-Supervised Foundation Model for Daily GNSS Displacement Time Series

 🌀Riemannian Computing  Content type: Academic
arxiv.org·

ibrahimqureshae/whisperx-transcriber: Offline AI transcription for Windows. Word-level timestamps. No cloud. No subscription. Free forever.

 🎙️Whisper  Content type: Code
github.com··r/editors

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help