Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Speech Recognition
🗣️ Speech Recognition
ASR, Acoustic Models, Phoneme Recognition, Voice Processing
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
259
posts in
18.7
ms
Towards Truly Multilingual
ASR
: Generalizing Code-Switching
ASR
to Unseen Language Pairs
🤖
Transformers
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Towards Truly Multilingual ASR: Generalizing Code-Switching ASR to Unseen Language Pairs
Can
Voice
Agents Handle Bilingual Customers? Benchmarking Frontier
ASR
on Code-Switched
Speech
🤖
Transformers
Content type:
Blog
huggingface.co
·
1d
1 day ago
Actions for Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech
DW News : DW : June 11, 2026 4:00am-4:02am CEST
🎛️
Audio DSP
archive.org
·
14h
14 hours ago
Actions for DW News : DW : June 11, 2026 4:00am-4:02am CEST
Evaluate Clinical
ASR
Models
Faster with Agent Skills and NVIDIA Nemotron
Speech
🤖
Transformers
Content type:
News
Content type:
Blog
developer.nvidia.com
·
2d
2 days ago
Actions for Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech
Treble Technologies and Hugging Face Address Benchmark of
Automatic
Speech
Recognition
Models
🤖
Machine Learning
audioxpress.com
·
6d
6 days ago
Actions for Treble Technologies and Hugging Face Address Benchmark of Automatic Speech Recognition Models
lbj96347/nemotron-3.5-asr-ios
: On-device, offline
speech
recognition
for iPhone/iPad using NVIDIA's
Nemotron-3.5-ASR
Streaming 0.6B (multilingual) via CoreML.SwiftUI app with mic capture + audio file import, RNN-Tdecoding, and live benchmark metrics (latency, RTF, memory).
🤖
Machine Learning
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for lbj96347/nemotron-3.5-asr-ios: On-device, offline speech recognition for iPhone/iPad using NVIDIA's Nemotron-3.5-ASR Streaming 0.6B (multilingual) via CoreML.SwiftUI app with mic capture + audio file import, RNN-Tdecoding, and live benchmark metrics (latency, RTF, memory).
What TTS Throws Away
🤖
Transformers
amaldavid.com
·
5d
5 days ago
·
Hacker News
Actions for What TTS Throws Away
Pico-Driven Ultrasound Enables Scaled
Acoustic
Model
of Home Stereo
🎛️
Audio DSP
hackaday.com
·
2d
2 days ago
Actions for Pico-Driven Ultrasound Enables Scaled Acoustic Model of Home Stereo
AI Week in Review 26.06.06
🌟
Ray Tracing
Content type:
News
Content type:
Blog
patmcguinness.substack.com
·
4d
4 days ago
·
Substack
Actions for AI Week in Review 26.06.06
Evaluating Bias in
Phoneme-Based
Automatic
Speech
Recognition Systems: An Analysis of IPA Transcription Models
🤖
Transformers
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Evaluating Bias in Phoneme-Based Automatic Speech Recognition Systems: An Analysis of IPA Transcription Models
Palabra.ai Review 2026: Real-Time
Speech
Translation, Tested Carefully
🤖
Transformers
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Palabra.ai Review 2026: Real-Time Speech Translation, Tested Carefully
DW News : DW : June 8, 2026 9:00pm-9:03pm CEST
🎛️
Audio DSP
archive.org
·
2d
2 days ago
Actions for DW News : DW : June 8, 2026 9:00pm-9:03pm CEST
Tight Boundary Prediction in
Speaker
Diarization
Using Causal-Anticausal Consistency
🤖
Transformers
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Tight Boundary Prediction in Speaker Diarization Using Causal-Anticausal Consistency
DW News : DW : June 10, 2026 9:00pm-9:02pm CEST : Free Borrow & Streaming
🎛️
Audio DSP
Content type:
Video
archive.org
·
21h
21 hours ago
Actions for DW News : DW : June 10, 2026 9:00pm-9:02pm CEST : Free Borrow & Streaming
tetherto/qvac: QVAC - Local AI SDK and libraries for building private, cross-platform, peer-to-peer AI applications. Run LLMs,
speech-to-text
, translation, and more locally on Linux, macOS, Windows, Android, and iOS.
🔍
RAG
Content type:
Code
github.com
·
6d
6 days ago
Actions for tetherto/qvac: QVAC - Local AI SDK and libraries for building private, cross-platform, peer-to-peer AI applications. Run LLMs, speech-to-text, translation, and more locally on Linux, macOS, Windows, Android, and iOS.
Enhancing Multilingual LLM-based
ASR
with Mixture of Experts and Dynamic Downsampling
🤖
Transformers
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Enhancing Multilingual LLM-based ASR with Mixture of Experts and Dynamic Downsampling
ALJAZ : June 11, 2026 4:30am-5:00am AST : Free Borrow & Streaming
📚
Compilers
Content type:
Video
archive.org
·
14h
14 hours ago
Actions for ALJAZ : June 11, 2026 4:30am-5:00am AST : Free Borrow & Streaming
Speaker
Group Encoding in Self-supervised
Speech
Recognition
Models
🤖
Transformers
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Speaker Group Encoding in Self-supervised Speech Recognition Models
Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language
Models
🎯
Escape Analysis
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models
ibrahimqureshae/whisperx-transcriber
: Offline AI transcription for Windows.
Word-level
timestamps. No cloud. No subscription. Free forever.
🌟
Ray Tracing
Content type:
Code
github.com
·
1d
1 day ago
·
r/editors
Actions for ibrahimqureshae/whisperx-transcriber: Offline AI transcription for Windows. Word-level timestamps. No cloud. No subscription. Free forever.
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help