🗣️ Speech Synthesis - nmarshall · Scour

Bridging the Stability-Expressivity Gap: Synthetic Data Scaling and Preference Alignment for Low-Resource Spoken Language Models 🤖LLMs

Supertonic TTS Android: Another Offline AI Voice Engine Joins the Open-Source TTS Wave 🎚️Voice AI Systems

[Also from Our Team] TTSFree AI for Mac: Turn Your Downloaded Subtitles into Natural Speech 🎚️Voice AI Systems

poppop.ai·2d·r/SurFastDownloader

5uck1ess/tts-bench: Quick speed bench: for all types of TTSs on Windows/Mac. 🎚️Voice AI Systems

github.com·6d·r/LocalLLaMA

Building Real-Time Voice Agents from Scratch - Learning Roadmap 🎚️Voice AI Systems

nemorize.com·1d·Hacker News

Bjango releases Robot Vowels formant filter plugin with launch discount (5 FREE copies inside) 🎤Voice Interfaces

bedroomproducersblog.com·22h

TTS doesn't suck anymore 🎭Gradual Typing

duarteocarmo.com·4d

AI ‘voice cloning’ scams are on the rise. Here’s how to protect yourself 🎚️Voice AI Systems

gumieri/nenya: A lightweight, highly secure AI API Gateway/Proxy written in Go. Acts as transparent middleware between local AI coding clients (OpenCode/Pi/Cursor) and upstream LLM providers (Gemini, DeepSeek, Zhipu z.ai). 💫Apache Pulsar

ElevenLabs is bringing Stan Lee back from the dead with AI voice cloning and digital cameos 🤖Anthropic Claude

thenextweb.com·2d

Live trending GitHub repositories — daily momentum ranking 🤖AI Coding Tools

trendshift.io·4d·Hacker News

Stan Lee's voice and likeness have been resurrected, thanks to AI 🎚️Voice AI Systems

engadget.com·1d

How We Built Dynamic NPC Dialogue with LLMs 🎙️Whisper

vantage-digital.online·5d·DEV

Spectrograms vs. MFCCs: Practical Tradeoffs in Audio ML [video] 🎵Audio DSP

youtube.com·1d·Hacker News

supertone-inc/supertonic: Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX. 🎙️Whisper

github.com·6d·r/ObsidianMD

ElevenLabs: Modern Slavery Policy Statement 🔧Right to Repair

elevenlabs.io·2d·Hacker News

Reachy Mini goes fully local 🎙️Whisper

huggingface.co·1d·Hacker News

Anthropic to expand Claude Voice Mode to more languages (2 minute read) 🤖Anthropic Claude

testingcatalog.com·3d

MELD: Mel-Spectrogram-Based Speech Language Modeling with Discrete Latent Variables 🎚️Voice AI Systems

Calibre 9.8 E-Book Manager Improves Content Server, Native TTS Engine, and More 💬Language Servers

linuxtoday.com·4d

Log in to enable infinite scrolling