speech-to-text, model, local, whisper, whisper.cpp, input, voice, recognition, ai

Show HN: Nanowakeword – Automates custom wake word model training
github.com·4h·
Discuss: Hacker News
🎵Audio ML
Probing Whisper for Dysarthric Speech in Detection and Assessment
arxiv.org·3d
🎵Audio ML
Show HN: AI Voice AudioBook – Convert ebooks to audio with your cloned voice
zan.chat·2h·
Discuss: Hacker News
🎧Learned Audio
From barks to words: Researchers aim to translate dog sounds with AI
phys.org·1d
🗣️CMU Pronouncing
The key to conversational speech recognition
datasciencecentral.com·21h
🎵Audio ML
Show HN: I built a local AI agent desk toy
blog.simone.computer·1d·
Discuss: Hacker News
📝Concrete Syntax
From Documents to Dialogue: A step-by-step RAG Journey
dev.to·1h·
Discuss: DEV
📊Multi-vector RAG
Show HN: I built a video-to-text tool – 10 min free daily, no signup
harku.io·2h·
Discuss: Hacker News
🎵Audio Streaming
Harmonizing AI Voices: Bridging the Gap in Intelligent Communication
dev.to·2d·
Discuss: DEV
🎧Learned Audio
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.ai·20h·
Discuss: Hacker News
💻Local LLMs
Tool or Agent? The impact of AI in your code and in your wallet It all boils down to math again!
blog.codeminer42.com·1d
Proof Automation
IASC: Interactive Agentic System for ConLangs
arxiv.org·11h
🌳Context free grammars
LoRA Explained: Faster, More Efficient Fine-Tuning with Docker
docker.com·1d
💻Local LLMs
​​Speech-to-Retrieval (S2R): A new approach to voice search
research.google·3d·
Discuss: Hacker News
🗂️Vector Search
Can Speech LLMs Think while Listening?
arxiv.org·11h
🎵Audio ML
Causality Guided Representation Learning for Cross-Style Hate Speech Detection
arxiv.org·11h
🧮Vector Embeddings
10 Data + AI Observations for Fall 2025
towardsdatascience.com·1h
🌊Stream Processing
Vibe-Coding vs. AI-Assisted Development
adaptivealchemist.com·3h·
Discuss: Hacker News
Incremental Computation