๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ‘‚ Psychoacoustic Coding

Perceptual Audio, Masking Models, Hearing Science, Lossy Compression

Unsupervised Multi-channel Speech Dereverberation via Diffusion
arxiv.orgยท1h
๐Ÿ‘‚Psychoacoustics
Trainable Dynamic Mask Sparse Attention
arxiv.orgยท1h
๐Ÿ“ŠLearned Metrics
PQCSA: A Gentle Introduction to Code Based PKE
esat.kuleuven.beยท18h
โš—๏ธAlgebraic Coding
Show HN: Open-source Voice Cloning at 16x real-time: Porting Chatterbox to vLLM
github.comยท1dยท
Discuss: Hacker News, r/LocalLLaMA
๐ŸŽฎGameboy Emulation
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation
arxiv.orgยท1d
๐Ÿ‘‚Psychoacoustics
Beyond Manually Designed Pruning Policies with Second-Level Performance Prediction: A Pruning Framework for LLMs
arxiv.orgยท1h
๐Ÿ’ปLocal LLMs
Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR
arxiv.orgยท1h
๐ŸŽตAudio ML
Title: Understanding LayerNorm and RMS Norm in Transformer Models
dev.toยท4hยท
Discuss: DEV
๐Ÿ“ŠQuantization
Context Guided Transformer Entropy Modeling for Video Compression
arxiv.orgยท1h
๐Ÿง Learned Codecs
A comprehensive taxonomy of hallucinations in Large Language Models
arxiv.orgยท1h
๐ŸŒณContext free grammars
Modality Bias in LVLMs: Analyzing and Mitigating Object Hallucination via Attention Lens
arxiv.orgยท1h
๐Ÿ“ŠRate-Distortion Theory
SAT Requires Exhaustive Search
link.springer.comยท8hยท
Discuss: Hacker News
๐ŸงฎKolmogorov Complexity
Beamformed 360{\deg} Sound Maps: U-Net-Driven Acoustic Source Segmentation and Localization
arxiv.orgยท1d
๐ŸŽตSound Archaeology
Non-Verbal Vocalisations and their Challenges: Emotion, Privacy, Sparseness, and Real Life
arxiv.orgยท1h
๐ŸŽ™๏ธWhisper
MAP: Mitigating Hallucinations in Large Vision-Language Models with Map-Level Attention Processing
arxiv.orgยท1h
๐Ÿง Learned Codecs
Information Rates of Approximate Message Passing for Bandlimited Direct-Detection Channels
arxiv.orgยท1h
โš›๏ธQuantum Compression
Filtering with Self-Attention and Storing with MLP: One-Layer Transformers Can Provably Acquire and Extract Knowledge
arxiv.orgยท1h
๐Ÿง Machine Learning
Is AI Really Making Us Dumber?
dev.toยท5hยท
Discuss: DEV
๐Ÿค–Grammar Induction
Mobile AI with ONNX Runtime: How to Build Real-Time Noise Suppression That Works
hackernoon.comยท1d
โšกModern Compression
MUTE-DSS: A Digital-Twin-Based Decision Support System for Minimizing Underwater Radiated Noise in Ship Voyage Planning
arxiv.orgยท1h
โš™๏ธTape Engineering
Loading...Loading more...
AboutBlogChangelogRoadmap