speech-to-text, model, local, whisper, whisper.cpp, input, voice, recognition, ai

NaturalVoices: A Large-Scale, Spontaneous and Emotional Podcast Dataset for Voice Conversion
arxiv.org·10h
🎵Audio Formats
Flag this post
I Use AI
ben.stolovitz.com·26m·
Discuss: Hacker News
Proof Automation
Flag this post
It Doesn’t Need to Be a Chatbot
towardsdatascience.com·13h
🎛️Feed Filtering
Flag this post
Spatial Sense: Unleashing Language Models on Location Data by Arvind Sundararajan
dev.to·18h·
Discuss: DEV
🔶Voronoi Diagrams
Flag this post
PAINT25 Invited Talk transcript: “Notational Freedom via Self-Raising Diagrams”
programmingmadecomplicated.wordpress.com·2h
📝Concrete Syntax
Flag this post
AI Uses Functions to Fetch Real Data (Not Just Chat)
farukalpay.substack.com·1h·
Discuss: Substack
Algebraic Effects
Flag this post
Portable Ai Voice Assistant
hackster.io·2d
🎵Audio ML
Flag this post
How to Use Multimodal AI Models With Docker Model Runner
docker.com·1d
📦Container Security
Flag this post
How We Built a Custom Vision LLM to Improve Document Processing at Grab
engineering.grab.com·15h·
Discuss: Hacker News
🤖Advanced OCR
Flag this post
Engineering.ai: A Platform for Teams of AI Engineers in Computational Design
arxiv.org·10h
Incremental Computation
Flag this post
Practical Steps Towards Vibe Writing with AI Positron
blog.oxygenxml.com·2d
🔄Language Evolution
Flag this post
Incremental Compilation in Recursive‑Descent Parser (Roslyn)
langdev.stackexchange.com·1d·
Discuss: Hacker News
🌳Incremental Parsing
Flag this post
Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play
arxiv.org·10h
📝ABNF Parsing
Flag this post
Contrastive Knowledge Transfer and Robust Optimization for Secure Alignment of Large Language Models
arxiv.org·1d
💻Local LLMs
Flag this post
📞 I'm Not a Coder but Used Claude to Build a Free AI Answering Service
dev.to·9h·
Discuss: DEV
🌀Brotli Internals
Flag this post
From hours to seconds: AI tools to detect animal calls
seangoedecke.com·2d·
Discuss: Hacker News
🎵Audio ML
Flag this post
Introducing Agent-o-rama: build, trace, evaluate, and monitor stateful LLM agents in Java or Clojure
blog.redplanetlabs.com·21h·
Discuss: Hacker News
🌊Streaming Systems
Flag this post
LongCat-Flash-Omni Technical Report
arxiv.org·10h
🎬WebCodecs
Flag this post
LangChain vs LangGraph: A Beginner’s Guide to Building Smarter AI Workflows
hackernoon.com·23h
🔗Topological Sorting
Flag this post
Challenge: Improve Multilingual ASR Performance for Mozilla
community.mozilladatacollective.com·1d·
Discuss: Hacker News
🗣️CMU Pronouncing
Flag this post