Speech Recognition, Music Information Retrieval, Acoustic Modeling, Sound Classification

ACMID: Automatic Curation of Musical Instrument Dataset for 7-Stem Music Source Separation
arxiv.orgยท12h
๐ŸŽตAudio Formats
Show HN: Nanowakeword โ€“ Automates custom wake word model training
github.comยท5hยท
Discuss: Hacker News
๐ŸŽ™๏ธWhisper
The key to conversational speech recognition
datasciencecentral.comยท22h
๐ŸŽ™๏ธWhisper
From Documents to Dialogue: A step-by-step RAG Journey
dev.toยท2hยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
INFER : Learning Implicit Neural Frequency Response Fields for Confined Car Cabin
arxiv.orgยท12h
๐Ÿ‘‚Psychoacoustic Coding
Neuro-Symbolic AI
en.wikipedia.orgยท2hยท
Discuss: Hacker News
๐Ÿ”ฒCellular Automata
Show HN: AI Voice AudioBook โ€“ Convert ebooks to audio with your cloned voice
zan.chatยท3hยท
Discuss: Hacker News
๐ŸŽ™๏ธWhisper
Can Speech LLMs Think while Listening?
arxiv.orgยท12h
๐ŸŽ™๏ธWhisper
How the Rise of Tabular Foundation Models Is Reshaping Data Science
towardsdatascience.comยท1d
๐Ÿง Machine Learning
In-Depth Analysis: "Attention Is All You Need"
dev.toยท1hยท
Discuss: DEV
๐Ÿง Intelligence Compression
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.aiยท21hยท
Discuss: Hacker News
๐Ÿ’ปLocal LLMs
MSF-SER: Enriching Acoustic Modeling with Multi-Granularity Semantics for Speech Emotion Recognition
arxiv.orgยท2d
๐ŸŽ™๏ธWhisper
How to Teach Large Multimodal Models New Skills
arxiv.orgยท12h
๐Ÿ“ŠLearned Metrics
SPAD: Specialized Prefill and Decode Hardware for Disaggregated LLM Inference
arxiv.orgยท12h
๐Ÿ’ปLocal LLMs
Fully Configurable Open Source Audio Spectrum Analyzer
dev.toยท1dยท
Discuss: DEV
๐ŸŒˆSpectral Audio
Audio-Visual Separation with Hierarchical Fusion and Representation Alignment
arxiv.orgยท12h
๐Ÿ’ฟFLAC Archaeology
A small number of samples can poison LLMs of any size
dev.toยท14hยท
Discuss: DEV
๐Ÿ’ปLocal LLMs
LoRA Explained: Faster, More Efficient Fine-Tuning with Docker
docker.comยท1d
๐Ÿ’ปLocal LLMs
Evaluating Small Vision-Language Models on Distance-Dependent Traffic Perception
arxiv.orgยท12h
๐Ÿง Machine Learning