Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Speech AI
🎙️ Speech AI
speech to speech, TTS, ASR, voice synthesis, Whisper
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
405
posts in
7.5
ms
Palabra.ai
Review 2026: Real-Time
Speech
Translation, Tested Carefully
🔮
Multimodal AI
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for Palabra.ai Review 2026: Real-Time Speech Translation, Tested Carefully
You don't need Copilot for code completion, try this instead
🔮
Multimodal AI
mistral.ai
·
2d
2 days ago
·
r/GithubCopilot
Actions for You don't need Copilot for code completion, try this instead
Azure
Speech
at Build 2026: Powering
Voice
Agents with Real-Time and Life-like Experiences
🧠
LLM Research
techcommunity.microsoft.com
·
6d
6 days ago
Actions for Azure Speech at Build 2026: Powering Voice Agents with Real-Time and Life-like Experiences
fix(doctor): keep
TTS
legacy migration on supported paths (#91787) · openclaw/openclaw@c0a4a78
🦀
Rust
Content type:
Code
github.com
·
13h
13 hours ago
Actions for fix(doctor): keep TTS legacy migration on supported paths (#91787) · openclaw/openclaw@c0a4a78
Americans lost nearly $900 million to
AI-powered
scams, FBI says
🔮
Multimodal AI
Content type:
Blog
malwarebytes.com
·
1d
1 day ago
Actions for Americans lost nearly $900 million to AI-powered scams, FBI says
The Most Emotive Foundation Models for
Voice
🧠
LLM Research
Content type:
Blog
misolabs.ai
·
6d
6 days ago
·
Hacker News
Actions for The Most Emotive Foundation Models for Voice
dcm31/self-improving-podcast
🎯
Reinforcement Learning
val.town
·
1d
1 day ago
·
Hacker News
Actions for dcm31/self-improving-podcast
Higgs Audio v3
TTS
4B. Built for
voice
chat. Support 100 languages and inline control.
🔮
Multimodal AI
huggingface.co
·
5d
5 days ago
·
Hacker News
,
r/LocalLLaMA
Actions for Higgs Audio v3 TTS 4B. Built for voice chat. Support 100 languages and inline control.
Interpreting and Steering a
Text-to-Speech
Language Model with Sparse Autoencoders
🔮
Multimodal AI
Content type:
Academic
arxiv.org
·
9h
9 hours ago
Actions for Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders
Severe Mispronunciation Stays Under 3% in English.
🧠
LLM Research
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Severe Mispronunciation Stays Under 3% in English.
New comment by motyar in "Ask HN: Who wants to be hired? (June 2026)"
🔧
Backend Dev
Content type:
Discussion
news.ycombinator.com
·
5d
5 days ago
·
Hacker News
Actions for New comment by motyar in "Ask HN: Who wants to be hired? (June 2026)"
lbj96347/nemotron-3.5-asr-ios
: On-device, offline
speech
recognition
for iPhone/iPad using NVIDIA's
Nemotron-3.5-ASR
Streaming 0.6B (multilingual) via CoreML.SwiftUI app with mic capture + audio file import, RNN-Tdecoding, and live benchmark metrics (latency, RTF, memory).
🔩
ML Compilers
Content type:
Code
github.com
·
9h
9 hours ago
·
Hacker News
Actions for lbj96347/nemotron-3.5-asr-ios: On-device, offline speech recognition for iPhone/iPad using NVIDIA's Nemotron-3.5-ASR Streaming 0.6B (multilingual) via CoreML.SwiftUI app with mic capture + audio file import, RNN-Tdecoding, and live benchmark metrics (latency, RTF, memory).
TLDR: Compressing Audio Tokens for Efficient Autoregressive
Text-to-Speech
🧠
LLM Research
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for TLDR: Compressing Audio Tokens for Efficient Autoregressive Text-to-Speech
A Realistic Faceless YouTube Shorts Workflow for Busy Makers
🔮
Multimodal AI
koinster7.gumroad.com
·
5d
5 days ago
·
DEV
Actions for A Realistic Faceless YouTube Shorts Workflow for Busy Makers
Wiring the ElevenLabs API into a real
pipeline
: the SDK is 4 lines, the billing isn't
🤖
AI Engineering
Content type:
Discussion
aialleyway.com
·
5d
5 days ago
·
DEV
Actions for Wiring the ElevenLabs API into a real pipeline: the SDK is 4 lines, the billing isn't
DW News : DW : June 8, 2026 9:00pm-9:03pm CEST
🧠
LLM Research
archive.org
·
1d
1 day ago
Actions for DW News : DW : June 8, 2026 9:00pm-9:03pm CEST
AI
Week in Review 26.06.06
🔮
Multimodal AI
Content type:
News
Content type:
Blog
patmcguinness.substack.com
·
3d
3 days ago
·
Substack
Actions for AI Week in Review 26.06.06
Critical Analysis of
TTS
Models via
VoiceArena
🧠
LLM Research
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Critical Analysis of TTS Models via VoiceArena
OpenBibleTTS: Large-Scale
Speech
Resources and
TTS
Models for Low-Resource Languages
🧠
LLM Research
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for OpenBibleTTS: Large-Scale Speech Resources and TTS Models for Low-Resource Languages
New comment by tjsawyer in "Ask HN: Who wants to be hired? (June 2026)"
🧠
LLM Research
Content type:
Discussion
news.ycombinator.com
·
6d
6 days ago
·
Hacker News
Actions for New comment by tjsawyer in "Ask HN: Who wants to be hired? (June 2026)"
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help