Speech AI

Feeds to Scour
SubscribedAll
Scoured 398 posts in 7.0 ms

Show HN: Every Claw Deserves a Face

 🤖Robotics
nyxclaw.ai··Hacker News

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

 🧠LLM Research  Content type: Blog

OpenBibleTTS: Large-Scale Speech Resources and TTS Models for Low-Resource Languages

 🧠LLM Research  Content type: Academic
arxiv.org·

1min.AI lifetime subscription (87% discount)

 🤖AI Engineering
sharewareonsale.com·

CODEANDTRUST/clawcall: Give your OpenClaw / self-hosted AI agent inbound phone calls - a Twilio-to-gateway voice bridge with working agent tools mid-call (MIT).

 🤖AI Engineering  Content type: Code
github.com··Hacker News

Async AI Review: Make & Edit Videos by Messaging an AI

 🔮Multimodal AI  Content type: Blog
medium.com·

BareWave: Waveform-Native Flow-Matching Text-to-Speech

 🔮Multimodal AI  Content type: Academic
arxiv.org·

The Phantom Syllable Dilemma: Decoupling Auto-Regressive Attention and Hallucinations in TTS

 🧠LLM Research  Content type: Blog
medium.com
·

Open Notebook’s AI-powered podcasts are a game-changer for productivity, provided you’re willing to configure them right

 🗄️Database Internals
xda-developers.com·

End-to-End Training for Discrete Token LLM based TTS System

 🧠LLM Research  Content type: Academic
arxiv.org·

fix(openai): require api-key auth for realtime voice (#91567) · openclaw/openclaw@9fdd56d

 🔧Backend Dev  Content type: Code
github.com·

KIT's Submission to Cross-Lingual Voice Cloning in IWSLT 2026

 🧠LLM Research  Content type: Academic
arxiv.org·

AuRA: Internalizing Audio Understanding into LLMs as LoRA

 🔮Multimodal AI  Content type: Academic
arxiv.org·

DW News : DW : June 7, 2026 11:00pm-11:03pm CEST

 🧠LLM Research
archive.org·

refactor(discord): distill reply hydration tests · openclaw/openclaw@c84e521

 🦀Rust  Content type: Code
github.com·

N\"ushuVoice: Reviving the Voice of Endangered N\"ushu with Pitch-Aware Text-to-Speech

 🧠LLM Research  Content type: Academic
arxiv.org·

dots.tts Technical Report

 🤖AI Engineering  Content type: Academic
arxiv.org·

docs: document tts runtime contracts · openclaw/openclaw@2f00fbf

 🔧Backend Dev  Content type: Code
github.com·

Optimality of FSQ Tokens for Continuous Diffusion for Categorical Data with Application to Text-to-Speech

 🧠LLM Research  Content type: Academic
arxiv.org·

DW News : DW : June 5, 2026 3:00am-3:02am CEST

 🖥️OS Development
archive.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help