Perceptual Audio, Masking Models, Hearing Science, Lossy Compression
PQCSA: A Gentle Introduction to Code Based PKE
esat.kuleuven.beยท18h
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation
arxiv.orgยท1d
Beyond Manually Designed Pruning Policies with Second-Level Performance Prediction: A Pruning Framework for LLMs
arxiv.orgยท1h
Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR
arxiv.orgยท1h
Modality Bias in LVLMs: Analyzing and Mitigating Object Hallucination via Attention Lens
arxiv.orgยท1h
Beamformed 360{\deg} Sound Maps: U-Net-Driven Acoustic Source Segmentation and Localization
arxiv.orgยท1d
Non-Verbal Vocalisations and their Challenges: Emotion, Privacy, Sparseness, and Real Life
arxiv.orgยท1h
MAP: Mitigating Hallucinations in Large Vision-Language Models with Map-Level Attention Processing
arxiv.orgยท1h
Information Rates of Approximate Message Passing for Bandlimited Direct-Detection Channels
arxiv.orgยท1h
Filtering with Self-Attention and Storing with MLP: One-Layer Transformers Can Provably Acquire and Extract Knowledge
arxiv.orgยท1h
Loading...Loading more...