speech-to-text, model, local, whisper, whisper.cpp, input, voice, recognition, ai

Don't Let It Fade: Preserving Edits in Diffusion Language Models via Token Timestep Allocation
arxiv.org·49m
💻Local LLMs
Flag this post
TheStageAI/TheWhisper: up to 3x faster optimized Whisper models for streaming and on-device use
github.com·1d·
🗣️Voice Coding
Flag this post
A Developer’s Guide to Real-Time Speech-to-Speech Translation for Mobile and VoIP Calls
future.forem.com·19h·
Discuss: DEV
🎚️Voice AI Systems
Flag this post
Vibe Coding Tools - Vibespecs CLI
dev.to·1h·
Discuss: DEV
vibe-coding
Flag this post
Vibe-Spec: Generate Specifications from Coding Agent Logs
marmelab.com·15h·
Discuss: Hacker News
vibe-coding
Flag this post
How to Build a Voice AI Agent Using Open-Source Tools
freecodecamp.org·3d·
🎚️Voice AI Systems
Flag this post
Opportunistically Parallel Lambda Calculus
dl.acm.org·6h·
Discuss: Hacker News
🔍Query Compilers
Flag this post
Abjad AI at NADI 2025: CATT-Whisper: Multimodal Diacritic Restoration Using Text and Speech Representations
arxiv.org·2d
🗣️Voice Coding
Flag this post
Show HN: Hot or Slop – Visual Turing test on how well humans detect AI images
hotorslop.com·1h·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
Tencent/WeKnora
github.com·3h
☁️Serverless Rust
Flag this post
LoCoT2V-Bench: A Benchmark for Long-Form and Complex Text-to-Video Generation
arxiv.org·49m
🗣️Speech Synthesis
Flag this post
Top 5 Text-to-Speech Open Source Models
kdnuggets.com·1d
🗣️Speech Synthesis
Flag this post
Circle or highlight on any app and get instant Jira/Linear tickets – no typing
flikhq.com·9h·
Discuss: Hacker News
vibe-coding
Flag this post
Hosting NVIDIA speech NIM models on Amazon SageMaker AI: Parakeet ASR
aws.amazon.com·2d
🏗️AI Infrastructure
Flag this post
Daily Artificial Intelligence Digest - Oct 31, 2025
dev.to·3h·
Discuss: DEV
🏗️AI Infrastructure
Flag this post
Writing an LLM from scratch, part 25 – instruction fine-tuning
gilesthomas.com·1d·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
Challenges in Building Natural, Low‑Latency, Reliable Voice Assistants
hackernoon.com·22h
🎤Voice Interfaces
Flag this post
VoxScribe: A platform to test Opensource Speech-to-Text models
blog.devops.dev·2d
🗣️Speech Synthesis
Flag this post