Speech Recognition, Music Information Retrieval, Acoustic Modeling, Sound Classification

High-Tech Sensors Expose the Secret Tricks of Piano Masters
scitechdaily.comยท1d
๐ŸŒˆSpectral Audio
[P] Lossless compression for 1D CNNs
reddit.comยท21hยท
๐Ÿ“ŠQuantization
My First Week of Vibecoding
underreacted.leaflet.pubยท5hยท
Discuss: Hacker News
๐ŸŽฏGradual Typing
CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning
arxiv.orgยท1d
๐ŸงฎVector Embeddings
StruSR: Structure-Aware Symbolic Regression with Physics-Informed Taylor Guidance
arxiv.orgยท2d
๐Ÿง Machine Learning
Revisiting Mixout: An Overlooked Path to Robust Finetuning
arxiv.orgยท2d
๐Ÿง Learned Codecs
Krish Naik: Complete RAG Crash Course With Langchain In 2 Hours
dev.toยท1dยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
Unraveling LCRE-Mediated Chromatin Loops: A Predictive Model for Gene Expression Fine-Tuning in Desert Genomes
dev.toยท8hยท
Discuss: DEV
๐Ÿ“ฅFeed Aggregation
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding
arxiv.orgยท2d
๐ŸŽฌAV1 Encoding
Krish Naik: Complete RAG Crash Course With Langchain In 2 Hours
dev.toยท1dยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
Improving Temporal Understanding Logic Consistency in Video-Language Models via Attention Enhancement
arxiv.orgยท1d
๐Ÿ“ŠLearned Metrics
Synergy Between the Strong and the Weak: Spiking Neural Networks are Inherently Self-Distillers
arxiv.orgยท1d
๐Ÿง Machine Learning
Harnessing LLM for Noise-Robust Cognitive Diagnosis in Web-Based Intelligent Education Systems
arxiv.orgยท4d
๐Ÿง Intelligence Compression
From RNNs to ChatGPT: The Paper That Changed How AI Thinks ๐Ÿค–
dev.toยท15hยท
Discuss: DEV
๐ŸŽงLearned Audio
Krish Naik: Complete RAG Crash Course With Langchain In 2 Hours
dev.toยท1hยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
Nearest Neighbor CCP-Based Molecular Sequence Analysis
arxiv.orgยท1d
๐Ÿ”„Burrows-Wheeler
H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference
arxiv.orgยท3d
๐Ÿ’จCache Optimization
Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
arxiv.orgยท2d
๐Ÿ’ปLocal LLMs
Utilizing Information Theoretic Approach to Study Cochlear Neural Degeneration
arxiv.orgยท2d
๐Ÿ‘‚Psychoacoustic Coding