speech-to-text, model, local, whisper, whisper.cpp, input, voice, recognition, ai

Show HN: Nanowakeword – Automates custom wake word model training
github.com·7h·
Discuss: Hacker News
🗣️Speech Synthesis
Probing Whisper for Dysarthric Speech in Detection and Assessment
arxiv.org·3d
🗣️Voice Coding
Show HN: AI Voice AudioBook – Convert ebooks to audio with your cloned voice
zan.chat·6h·
Discuss: Hacker News
🗣️Speech Synthesis
English - the hottest programming language of the future
dev.to·1h·
Discuss: DEV
🧩Low-code
The key to conversational speech recognition
datasciencecentral.com·1d
🎤Voice Interfaces
Show HN: I built a local AI agent desk toy
blog.simone.computer·1d·
Discuss: Hacker News
🤖AI agents
I'm an AI tools expert, and these are the 4 I pay for now (plus 2 I'm eyeing) - ZDNET
news.google.com·1d
🤖AI agents
Show HN: I built a video-to-text tool – 10 min free daily, no signup
harku.io·5h·
Discuss: Hacker News
🗣️Speech Synthesis
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.ai·23h·
Discuss: Hacker News
🏗️AI Infrastructure
Harmonizing AI Voices: Bridging the Gap in Intelligent Communication
dev.to·2d·
Discuss: DEV
🎤Voice Interfaces
LoRA Explained: Faster, More Efficient Fine-Tuning with Docker
docker.com·1d
🏠Self-hosted AI
IASC: Interactive Agentic System for ConLangs
arxiv.org·15h
💬Language Servers
​​Speech-to-Retrieval (S2R): A new approach to voice search
research.google·3d·
Discuss: Hacker News
🎤Voice Interfaces
I built a translator for spatial thinking (because I can't interview in Python)
graemefawcett.ca·10m·
Discuss: Hacker News
vibe-coding
Can Speech LLMs Think while Listening?
arxiv.org·15h
🎤Voice Interfaces
Causality Guided Representation Learning for Cross-Style Hate Speech Detection
arxiv.org·15h
📱Edge AI
AI receptionist that answers real phone calls
news.ycombinator.com·1h·
Discuss: Hacker News
🧠AI
Vibe-Coding vs. AI-Assisted Development
adaptivealchemist.com·7h·
Discuss: Hacker News
🤖AI agents