🎵 Audio ML - matmat · Scour

LoRA Explained: Faster, More Efficient Fine-Tuning with Docker

docker.com·1d

Evaluating Small Vision-Language Models on Distance-Dependent Traffic Perception

arxiv.org·17h

🧠Machine Learning

Fully Configurable Open Source Audio Spectrum Analyzer

dev.to·2d·

Discuss: DEV

🌈Spectral Audio

Neu-RadBERT for Enhanced Diagnosis of Brain Injuries and Conditions

arxiv.org·1d

🔍Vector Forensics

Instance Relation Learning Network with Label Knowledge Propagation for Few-shot Multi-label Intent Detection

arxiv.org·17h

🤖Grammar Induction

LLM Optimization Notes: Memory, Compute and Inference Techniques

gaurigupta19.github.io·4d·

Discuss: Hacker News

microsoft/UserLM-8b - “Unlike typical LLMs that are trained to play the role of the 'assistant' in conversation, we trained UserLM-8b to simulate the 'user' rol...

huggingface.co·1d·

Discuss: DEV, Hacker News, r/LocalLLaMA

LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?

arxiv.org·17h

🔗Parser Combinators

Enhancing Underwater Acoustic Communication via Adaptive Beamforming and Deep Learning Noise Cancellation

dev.to·16h·

Discuss: DEV

🎧Learned Audio

Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation

arxiv.org·17h

👁️Perceptual Coding

CaRT: Teaching LLM Agents to Know When They Know Enough

arxiv.org·17h

🔲Cellular Automata

Less Is More: Recursive Reasoning with Tiny Networks

github.com·2d·

Discuss: Hacker News

📊Quantization

LogSTOP: Temporal Scores over Prediction Sequences for Matching and Retrieval

arxiv.org·1d

📊Learned Metrics

AbsoluteNet: A Deep Learning Neural Network to Classify Cerebral Hemodynamic Responses of Auditory Processing

arxiv.org·1d

👂Psychoacoustics

TaoSR-SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance

arxiv.org·17h

🔍Information Retrieval

MultiCNKG: Integrating Cognitive Neuroscience, Gene, and Disease Knowledge Graphs Using Large Language Models

arxiv.org·1d

🧠Intelligence Compression

MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding

arxiv.org·17h

🧠Learned Codecs

Krish Naik: Complete RAG Crash Course With Langchain In 2 Hours

dev.to·13h·

Discuss: DEV

Expanding the Action Space of LLMs to Reason Beyond Language

arxiv.org·17h

ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping

arxiv.org·17h

🧮Kolmogorov Complexity

Loading more...