HiFi-HARP: A High-Fidelity 7th-Order Ambisonic Room Impulse Response Dataset
arxiv.orgยท1d
๐Psychoacoustic Coding
Flag this post
Enhanced Spectral Decomposition for High-Dimensional Bio-Signal Classification
๐Spectral Methods
Flag this post
The 4 best all-new headphones I listened to at the Paris Audio Show 2025
techradar.comยท1d
๐ต8-track Revival
Flag this post
Project AV and Unscripted bring AV and design together
madcornishprojectionist.co.ukยท21h
๐ฌAV1 Encoding
Flag this post
Swift Sings: MDX-Net Vocal Splits and RVC Voice Conversion On-Device with ONNX/CoreML
web.navan.devยท1d
๐งLearned Audio
Flag this post
50-Year-Old Mystery Solved? Scientists Uncover Why People with Schizophrenia โHear Voicesโ
scitechdaily.comยท1d
๐ตMusic Universality
Flag this post
Taming Text-to-Sounding Video Generation via Advanced Modality Condition andInteraction
๐ง Neural Compression
Flag this post
The Best Audio Interfaces of 2025: Universal Audio and More
wired.comยท3d
๐งAudio Mastering
Flag this post
LibriConvo: Simulating Conversations from Read Literature for ASR and Diarization
arxiv.orgยท2h
๐ตAudio ML
Flag this post
Active Noise Cancellation via Multi-Modal Acoustic Surface Metamaterial Structures for In-Cabin Vehicle Acoustics
๐งCassette Engineering
Flag this post
RatioWaveNet: A Learnable RDWT Front-End for Robust and Interpretable EEG Motor-Imagery Classification
arxiv.orgยท2h
๐Learned Metrics
Flag this post
How to Apply Powerful AI Audio Models to Real-World Applications
towardsdatascience.comยท10h
๐ตAudio ML
Flag this post
Frequency-Spatial Interaction Driven Network for Low-Light Image Enhancement
arxiv.orgยท2h
๐Rate-Distortion Theory
Flag this post
Compositional Bias Control in Large Language Models: Preference Learning Fails, Supervision Succeeds
arxiv.orgยท2h
๐๏ธFeed Filtering
Flag this post
Audio Frequency-Time Dual Domain Evaluation on Depression Diagnosis
arxiv.orgยท2h
๐Spectrograms
Flag this post
Automated Tinnitus Detection Through Dual-Modality Neuroimaging: EEG Microstate Analysis and Resting-State fMRI Classification Using Deep Learning
arxiv.orgยท2h
๐Spectral Audio
Flag this post
The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
arxiv.orgยท2h
๐๏ธWhisper
Flag this post
Foley Control: Aligning a Frozen Latent Text-to-Audio Model to Video
arxiv.orgยท1d
๐งVorbis Encoding
Flag this post
Variational autoencoders stabilise TCN performance when classifying weakly labelled bioacoustics data: an interdisciplinary approach
arxiv.orgยท1d
๐ง Learned Codecs
Flag this post
Loading...Loading more...