🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🗣️ Speech Synthesis

Neural TTS, Voice Cloning, Real-time Audio, Kitten TTS

What do Speech Foundation Models Learn? Analysis and Applications
arxiv.org·13h
🎙️Whisper
Behind the Mic: Real-World Challenges of Voice AI
trata.ai·1d·
Discuss: Hacker News
🎚️Voice AI Systems
How to Make Generative AI in Python
dev.to·3h·
Discuss: DEV
🏗️AI Infrastructure
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-Scale Pretraining
blog.datologyai.com·1d·
Discuss: Hacker News
🏗️AI Infrastructure
Launch HN: Uplift (YC S25) – Voice models for under-served languages
news.ycombinator.com·5h·
Discuss: Hacker News
🗣️Voice Coding
Language Models as Thespians
jstrieb.github.io·9h·
Discuss: Lobsters, Hacker News, r/programming
💻Local LLMs
An efficient path to production AI: Kakao’s journey with JAX and Cloud TPUs
cloud.google.com·1h
🏗️AI Infrastructure
Chinese Room vs. SupatMod Experiment 1/7 (Claude, Mar 12-13, 2025)
medium.com·5h·
Discuss: Hacker News
🎤Voice Interfaces
NVIDIA's Granary dataset is remarkable but it exposes a fundamental misunderstanding about production voice AI architecture.
dev.to·1d·
Discuss: DEV
🎚️Voice AI Systems
Synchronization and semantization in deep spiking networks
arxiv.org·13h
🧠Neuromorphic Chips
Show HN: Privacy first tone rewriter Chrome extension using local Gemini Nano
github.com·1d·
Discuss: Hacker News
🎙️Whisper
Rethinking Non-Negative Matrix Factorization with Implicit Neural Representations
machinelearning.apple.com·1d
💻Local LLMs
How to model the world? Introduction to Laplace Neuron
abibulic.github.io·3h·
Discuss: Hacker News
🧠Neuromorphic Chips
Dyslexia and AI: How I use AI to help me write better articles as a blogger with Dyslexia
homeworkinghenry.com·1d
⏱️productivity
The Hidden Costs of Coding With Generative AI - MIT Sloan Management Review
news.google.com·1d
🏗️AI Infrastructure
Transform Static Images Into Dynamic Videos With JoggAI’s Talking Photo
hackernoon.com·8h
🎤Voice Interfaces
DeCoT: Decomposing Complex Instructions for Enhanced Text-to-Image Generation with Large Language Models
arxiv.org·13h
🎙️Whisper
LoRAtorio: An intrinsic approach to LoRA Skill Composition
arxiv.org·1d
🤖AI agents
ADMIRE-BayesOpt: Accelerated Data MIxture RE-weighting for Language Models with Bayesian Optimization
arxiv.org·1d
🏗️AI Infrastructure
Natural Language → SQL with Reinforcement Fine Tuning (RFT)
docs.fireworks.ai·18h·
Discuss: Hacker News
🔍Query Compilers
Loading...Loading more...
AboutBlogChangelogRoadmap