Top 5 Text-to-Speech Open Source Models
kdnuggets.com·1d
🎚️Voice AI Systems
Flag this post
Emergent introspective awareness in large language models
transformer-circuits.pub·5h·
Discuss: Hacker News
🏠Self-hosted AI
Flag this post
Artificial Neural Networks Trained on Noisy Speech Exhibit the McGurk Effect
arxiv.org·1d
🎚️Voice AI Systems
Flag this post
Linear Audio Dreams: Injecting Sanity into Autoencoder Latent Spaces by Arvind Sundararajan
dev.to·2d·
Discuss: DEV
🎤Voice Interfaces
Flag this post
Text to Speech Sam
texttospeechrobot.com·4d·
Discuss: Hacker News
🎚️Voice AI Systems
Flag this post
From Frustration to Creation: How a New Way Brought My Ideas to Life
videoasprompt.com·1d·
Discuss: Hacker News
🎚️Voice AI Systems
Flag this post
A Developer’s Guide to Real-Time Speech-to-Speech Translation for Mobile and VoIP Calls
future.forem.com·1d·
Discuss: DEV
🎚️Voice AI Systems
Flag this post
Writing an LLM from scratch, part 25 – instruction fine-tuning
gilesthomas.com·1d·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
How to Build a Voice AI Agent Using Open-Source Tools
freecodecamp.org·3d·
🎙️Whisper
Flag this post
I made a 10¢ MCU Talk
atomic14.com·1d·
Discuss: Hacker News
🎚️Audio Codecs
Flag this post
Emergent Introspective Awareness in Large Language Models
lesswrong.com·1d
🏠Self-hosted AI
Flag this post
How to Apply Powerful AI Audio Models to Real-World Applications
towardsdatascience.com·3d
🎤Voice Interfaces
Flag this post
I Built a Podcast Script Prompt That Actually Works—Here's the Complete Template
dev.to·20h·
Discuss: DEV
vibe-coding
Flag this post
Toward Machine Interpreting: Lessons from Human Interpreting Studies
machinelearning.apple.com·2d
🎚️Voice AI Systems
Flag this post
5 must know open-source repositories to build cool AI apps
dev.to·2d·
Discuss: DEV
🎚️Voice AI Systems
Flag this post
Veo3 vs. Wan2.2 vs. Sora2: Zero-Shot Video Generation Comparison
nuefunnel.com·16h·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
Everything About Transformers
krupadave.com·1d·
🏗️AI Infrastructure
Flag this post
Metis-SPECS: Decoupling Multimodal Learning via Self-distilled Preference-based Cold Start
arxiv.org·5h
🏗️AI Infrastructure
Flag this post
TheStageAI/TheWhisper: up to 3x faster optimized Whisper models for streaming and on-device use
github.com·1d·
🎙️Whisper
Flag this post
Challenges in Building Natural, Low‑Latency, Reliable Voice Assistants
hackernoon.com·1d
🎤Voice Interfaces
Flag this post