Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🗣️ Speech Synthesis
Neural TTS, Voice Cloning, Real-time Audio, Kitten TTS
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
154209
posts in
14.9
ms
4 Open-Source
TTS
Models That Can
Clone
Voices and Actually Sound Human
🎚️
Voice AI Systems
firethering.com
·
5d
·
Hacker News
A Guide to Voice
Cloning
on
Voxtral
with a Missing Encoder
🎚️
Voice AI Systems
towardsdatascience.com
·
20h
Show HN: I
bootstrapped
a
foundational
text-to-speech model from scratch
🎙️
Whisper
tontaube.ai
·
2d
·
Hacker News
Towards Real-Time Human-AI Musical Co-Performance:
Accompaniment
Generation with Latent Diffusion Models and
MAX/MSP
🎚️
Voice AI Systems
arxiv.org
·
1d
Saganaki22/ComfyUI-OmniVoice-TTS
:
OmniVoice
TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue
🎙️
Whisper
github.com
·
5d
Microsoft
Rolls
Out New Speech Models in Push Toward First-Party AI
Stack
🎚️
Voice AI Systems
slator.com
·
2d
Omni
Voice: Free AI Voice Generator & Voice
Cloning
🎚️
Voice AI Systems
omnivoice.app
·
3d
·
Hacker News
Free
Unlimited
Voice
Cloning
Forever
🎚️
Voice AI Systems
artlu.bearblog.dev
·
2d
Escaping the Fork: How Meta
Modernized
WebRTC
Across 50+ Use Cases
📹
WebRTC
engineering.fb.com
·
1d
Building real-time
conversational
podcasts with Amazon Nova 2
Sonic
🌊
Event Streaming
aws.amazon.com
·
3d
VibeVoice
: A
Frontier
Open-Source Text-to-Speech Model
🎤
Voice Interfaces
microsoft.github.io
·
6d
tronghieuit/tiny-tts
: The Smallest English
TTS
Model with only 1M parameters
🎚️
Voice AI Systems
github.com
·
2d
·
Hacker News
Expressive
Prompting: Improving Emotion
Intensity
and Speaker Consistency in Zero-Shot TTS
🎚️
Voice AI Systems
arxiv.org
·
5d
An
experiment
in voice text
editing
with Gemini Live
🎚️
Voice AI Systems
public.grugnotes.com
·
4d
·
Hacker News
Fine-tuning
Whisper
to my speech: 27% to 6.5%
WER
🎙️
Whisper
vivekkairi.com
·
5d
·
Hacker News
CapTalk
: Unified Voice Design for
Single-Utterance
and Dialogue Speech Generation
🎤
Voice Interfaces
arxiv.org
·
1d
TASU2
: Controllable
CTC
Simulation for Alignment and Low-Resource Adaptation of Speech LLMs
🎚️
Voice AI Systems
arxiv.org
·
1d
Brain-to-Speech:
Prosody
Feature Engineering and Transformer-Based
Reconstruction
🎚️
Voice AI Systems
arxiv.org
·
3d
OmniSonic
: Towards Universal and
Holistic
Audio Generation from Video and Text
🎤
Voice Interfaces
arxiv.org
·
4d
MMTalker
:
Multiresolution
3D Talking Head Synthesis with Multimodal Feature Fusion
🎚️
Voice AI Systems
arxiv.org
·
5d
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help