Speech Recognition, Music Information Retrieval, Acoustic Modeling, Sound Classification

ACMID: Automatic Curation of Musical Instrument Dataset for 7-Stem Music Source Separation
arxiv.org·22h
🎵Audio Formats
A gentle introduction to Generative AI: Historical perspective
medium.com·1h·
Discuss: Hacker News
🧠Learned Codecs
The key to conversational speech recognition
datasciencecentral.com·1d
🎙️Whisper
Show HN: Nanowakeword – Automates custom wake word model training
github.com·14h·
Discuss: Hacker News
🎙️Whisper
From Documents to Dialogue: A step-by-step RAG Journey
dev.to·12h·
Discuss: DEV
📊Multi-vector RAG
Creating Real-Time Multimodal AI Pipelines: Scaling File Processing to 50M Daily Uploads
engineering.salesforce.com·2h
🌊Stream Processing
INFER : Learning Implicit Neural Frequency Response Fields for Confined Car Cabin
arxiv.org·22h
👂Psychoacoustic Coding
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.ai·1d
📊Feed Optimization
Show HN: AI Voice AudioBook – Convert ebooks to audio with your cloned voice
zan.chat·13h·
Discuss: Hacker News
🎙️Whisper
Revisiting Karpathy's 'Unreasonable Effectiveness of Recurrent Neural Networks'
gilesthomas.com·1h·
Discuss: Hacker News
🎧Learned Audio
Can Speech LLMs Think while Listening?
arxiv.org·22h
🎙️Whisper
How the Rise of Tabular Foundation Models Is Reshaping Data Science
towardsdatascience.com·1d
🧠Machine Learning
🧠 Real-Time Smart Speech Assistant with Python, Whisper & LLMs
dev.to·5h·
Discuss: DEV
🎙️Whisper
In-Depth Analysis: "Attention Is All You Need"
dev.to·10h·
Discuss: DEV
🧠Intelligence Compression
Automated Spectral Fingerprint Deconvolution for Polymer Identification via Deep Oligomer Networks
dev.to·1h·
Discuss: DEV
🌈Spectroscopy
MSF-SER: Enriching Acoustic Modeling with Multi-Granularity Semantics for Speech Emotion Recognition
arxiv.org·2d
🎙️Whisper
How to Teach Large Multimodal Models New Skills
arxiv.org·22h
📊Learned Metrics
SPAD: Specialized Prefill and Decode Hardware for Disaggregated LLM Inference
arxiv.org·22h·
Discuss: r/LLM
💻Local LLMs
Neuro-Symbolic AI
en.wikipedia.org·11h·
Discuss: Hacker News
🔲Cellular Automata