Speech Recognition, Music Information Retrieval, Acoustic Modeling, Sound Classification

LoRA Explained: Faster, More Efficient Fine-Tuning with Docker
docker.com·1d
💻Local LLMs
Evaluating Small Vision-Language Models on Distance-Dependent Traffic Perception
arxiv.org·17h
🧠Machine Learning
Fully Configurable Open Source Audio Spectrum Analyzer
dev.to·2d·
Discuss: DEV
🌈Spectral Audio
Neu-RadBERT for Enhanced Diagnosis of Brain Injuries and Conditions
arxiv.org·1d
🔍Vector Forensics
LLM Optimization Notes: Memory, Compute and Inference Techniques
gaurigupta19.github.io·4d·
Discuss: Hacker News
💻Local LLMs
LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?
arxiv.org·17h
🔗Parser Combinators
Enhancing Underwater Acoustic Communication via Adaptive Beamforming and Deep Learning Noise Cancellation
dev.to·16h·
Discuss: DEV
🎧Learned Audio
Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation
arxiv.org·17h
👁️Perceptual Coding
CaRT: Teaching LLM Agents to Know When They Know Enough
arxiv.org·17h
🔲Cellular Automata
Less Is More: Recursive Reasoning with Tiny Networks
github.com·2d·
Discuss: Hacker News
📊Quantization
LogSTOP: Temporal Scores over Prediction Sequences for Matching and Retrieval
arxiv.org·1d
📊Learned Metrics
TaoSR-SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance
arxiv.org·17h
🔍Information Retrieval
MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding
arxiv.org·17h
🧠Learned Codecs
Krish Naik: Complete RAG Crash Course With Langchain In 2 Hours
dev.to·13h·
Discuss: DEV
🔨Compilers
Expanding the Action Space of LLMs to Reason Beyond Language
arxiv.org·17h
💻Local LLMs
ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping
arxiv.org·17h
🧮Kolmogorov Complexity