Speech Recognition

Feeds to Scour
SubscribedAll
Scoured 260 posts in 6.6 ms

Towards Truly Multilingual ASR: Generalizing Code-Switching ASR to Unseen Language Pairs

 🤖Transformers  Content type: Academic
arxiv.org·

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

 🤖Transformers  Content type: Blog
huggingface.co·

DW News : DW : June 11, 2026 4:00am-4:02am CEST

 🎛️Audio DSP
archive.org·

Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech

 🤖Transformers  Content type: News  Content type: Blog

Treble Technologies and Hugging Face Address Benchmark of Automatic Speech Recognition Models

 🤖Machine Learning
audioxpress.com·

lbj96347/nemotron-3.5-asr-ios: On-device, offline speech recognition for iPhone/iPad using NVIDIA's Nemotron-3.5-ASR Streaming 0.6B (multilingual) via CoreML.SwiftUI app with mic capture + audio file import, RNN-Tdecoding, and live benchmark metrics (latency, RTF, memory).

 🤖Machine Learning  Content type: Code
github.com··Hacker News

What TTS Throws Away

 🤖Transformers
amaldavid.com··Hacker News

Pico-Driven Ultrasound Enables Scaled Acoustic Model of Home Stereo

 🎛️Audio DSP
hackaday.com·

AI Week in Review 26.06.06

 🌟Ray Tracing  Content type: News  Content type: Blog

Evaluating Bias in Phoneme-Based Automatic Speech Recognition Systems: An Analysis of IPA Transcription Models

 🤖Transformers  Content type: Academic
arxiv.org·

Palabra.ai Review 2026: Real-Time Speech Translation, Tested Carefully

 🤖Transformers  Content type: Blog
medium.com·

DW News : DW : June 8, 2026 9:00pm-9:03pm CEST

 🎛️Audio DSP
archive.org·

Tight Boundary Prediction in Speaker Diarization Using Causal-Anticausal Consistency

 🤖Transformers  Content type: Academic
arxiv.org·

DW News : DW : June 10, 2026 9:00pm-9:02pm CEST : Free Borrow & Streaming

 🎛️Audio DSP  Content type: Video
archive.org·

rccyx/asryx: Daemonless Linux native ASR binary (embedded via whisper.cpp C API, no dependencies beyond the standard C++ and Linux toolchain)

 🤖Machine Learning  Content type: Code
github.com··Hacker News

Enhancing Multilingual LLM-based ASR with Mixture of Experts and Dynamic Downsampling

 🤖Transformers  Content type: Academic
arxiv.org·

ALJAZ : June 11, 2026 4:30am-5:00am AST : Free Borrow & Streaming

 📚Compilers  Content type: Video
archive.org·

Speaker Group Encoding in Self-supervised Speech Recognition Models

 🤖Transformers  Content type: Academic
arxiv.org·

News : RT : June 10, 2026 12:00pm-12:31pm EDT : Free Borrow & Streaming

 🤖AI  Content type: Video
archive.org·

tetherto/qvac: QVAC - Local AI SDK and libraries for building private, cross-platform, peer-to-peer AI applications. Run LLMs, speech-to-text, translation, and more locally on Linux, macOS, Windows, Android, and iOS.

 🔍RAG  Content type: Code
github.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help