speech-to-text, model, local, whisper, whisper.cpp, input, voice, recognition, ai

Show HN: Interview Transcription with AI Quote Extraction and Q&A Format
harku.io·1d·
Discuss: Hacker News
🎚️Voice AI Systems
Flag this post
Olsrt – OverLab Streams Runtime
news.ycombinator.com·2h·
Discuss: Hacker News
☁️Serverless Rust
Flag this post
Bringing Vision-Language Intelligence to RAG with ColPali
towardsdatascience.com·1d
🧩Low-code
Flag this post
10 AI Tools Every Business Needs in 2026
vibe.forem.com·5h·
Discuss: DEV
🧠AI
Flag this post
Why do AI models use so many em-dashes?
seangoedecke.com·1d·
vibe-coding
Flag this post
How Machine Learning Is Solving the $2 Trillion Contract Management Problem
dev.to·5h·
Discuss: DEV
🧠AI
Flag this post
Contribution-Guided Asymmetric Learning for Robust Multimodal Fusion under Imbalance and Noise
arxiv.org·13h
💻Local LLMs
Flag this post
EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language Models
arxiv.org·3d
🎤Voice Interfaces
Flag this post
Arabic Little STT: Arabic Children Speech Recognition Dataset
arxiv.org·3d
🎚️Voice AI Systems
Flag this post
Challenges in Building Natural, Low‑Latency, Reliable Voice Assistants
hackernoon.com·1d
🎤Voice Interfaces
Flag this post
Testing Cross-Lingual Text Comprehension In LLMs Using Next Sentence Prediction
arxiv.org·1d
💻Local LLMs
Flag this post
Falcon: A Comprehensive Chinese Text-to-SQL Benchmark for Enterprise-Grade Evaluation
arxiv.org·1d
🔍Query Compilers
Flag this post
Efficient Generative AI Boosts Probabilistic Forecasting of Sudden Stratospheric Warmings
arxiv.org·13h
🏗️AI Infrastructure
Flag this post
Latent Refinement Decoding: Enhancing Diffusion-Based Language Models byRefining Belief States
dev.to·17h·
Discuss: DEV
🏗️AI Infrastructure
Flag this post
Unleashing Creativity: Exploring Top Generative AI Datasets for Multimodal Innovation
dev.to·1d·
Discuss: DEV
🏗️AI Infrastructure
Flag this post
Building a Self-Improving RAG System with Smart Query Routing and Answer Validation
dev.to·4h·
Discuss: DEV
🔍Query Compilers
Flag this post
Towards a Method for Synthetic Generation of PWA Transcripts
arxiv.org·1d
🗣️Speech Synthesis
Flag this post
Show HN: E2E Testing for Chatbots
github.com·2d·
Discuss: Hacker News
vibe-coding
Flag this post
Structurally Valid Log Generation using FSM-GFlowNets
arxiv.org·13h
🔍Query Compilers
Flag this post