Top 5 Text-to-Speech Open Source Models
kdnuggets.com·1d
🎚️Voice AI Systems
Flag this post
Artificial Neural Networks Trained on Noisy Speech Exhibit the McGurk Effect
arxiv.org·1d
🎚️Voice AI Systems
Flag this post
Linear Audio Dreams: Injecting Sanity into Autoencoder Latent Spaces by Arvind Sundararajan
🎤Voice Interfaces
Flag this post
Text to Speech Sam
🎚️Voice AI Systems
Flag this post
From Frustration to Creation: How a New Way Brought My Ideas to Life
🎚️Voice AI Systems
Flag this post
A Developer’s Guide to Real-Time Speech-to-Speech Translation for Mobile and VoIP Calls
🎚️Voice AI Systems
Flag this post
I made a 10¢ MCU Talk
🎚️Audio Codecs
Flag this post
Emergent Introspective Awareness in Large Language Models
lesswrong.com·1d
🏠Self-hosted AI
Flag this post
How to Apply Powerful AI Audio Models to Real-World Applications
towardsdatascience.com·3d
🎤Voice Interfaces
Flag this post
I Built a Podcast Script Prompt That Actually Works—Here's the Complete Template
✨vibe-coding
Flag this post
Toward Machine Interpreting: Lessons from Human Interpreting Studies
machinelearning.apple.com·2d
🎚️Voice AI Systems
Flag this post
Everything About Transformers
🏗️AI Infrastructure
Flag this post
Metis-SPECS: Decoupling Multimodal Learning via Self-distilled Preference-based Cold Start
arxiv.org·5h
🏗️AI Infrastructure
Flag this post
TheStageAI/TheWhisper: up to 3x faster optimized Whisper models for streaming and on-device use
🎙️Whisper
Flag this post
Challenges in Building Natural, Low‑Latency, Reliable Voice Assistants
hackernoon.com·1d
🎤Voice Interfaces
Flag this post
Loading...Loading more...