HiFi-HARP: A High-Fidelity 7th-Order Ambisonic Room Impulse Response Dataset
arxiv.orgยท1d
๐Ÿ‘‚Psychoacoustic Coding
Flag this post
Enhanced Spectral Decomposition for High-Dimensional Bio-Signal Classification
dev.toยท5hยท
Discuss: DEV
๐ŸŒˆSpectral Methods
Flag this post
The 4 best all-new headphones I listened to at the Paris Audio Show 2025
techradar.comยท1d
๐ŸŽต8-track Revival
Flag this post
Project AV and Unscripted bring AV and design together
madcornishprojectionist.co.ukยท21h
๐ŸŽฌAV1 Encoding
Flag this post
Swift Sings: MDX-Net Vocal Splits and RVC Voice Conversion On-Device with ONNX/CoreML
web.navan.devยท1d
๐ŸŽงLearned Audio
Flag this post
50-Year-Old Mystery Solved? Scientists Uncover Why People with Schizophrenia โ€œHear Voicesโ€
scitechdaily.comยท1d
๐ŸŽตMusic Universality
Flag this post
Taming Text-to-Sounding Video Generation via Advanced Modality Condition andInteraction
dev.toยท1dยท
Discuss: DEV
๐Ÿง Neural Compression
Flag this post
The Best Audio Interfaces of 2025: Universal Audio and More
wired.comยท3d
๐ŸŽงAudio Mastering
Flag this post
LibriConvo: Simulating Conversations from Read Literature for ASR and Diarization
arxiv.orgยท2h
๐ŸŽตAudio ML
Flag this post
Active Noise Cancellation via Multi-Modal Acoustic Surface Metamaterial Structures for In-Cabin Vehicle Acoustics
dev.toยท6hยท
Discuss: DEV
๐Ÿ”งCassette Engineering
Flag this post
RatioWaveNet: A Learnable RDWT Front-End for Robust and Interpretable EEG Motor-Imagery Classification
arxiv.orgยท2h
๐Ÿ“ŠLearned Metrics
Flag this post
How to Apply Powerful AI Audio Models to Real-World Applications
towardsdatascience.comยท10h
๐ŸŽตAudio ML
Flag this post
Frequency-Spatial Interaction Driven Network for Low-Light Image Enhancement
arxiv.orgยท2h
๐Ÿ“ŠRate-Distortion Theory
Flag this post
Compositional Bias Control in Large Language Models: Preference Learning Fails, Supervision Succeeds
arxiv.orgยท2h
๐ŸŽ›๏ธFeed Filtering
Flag this post
Audio Frequency-Time Dual Domain Evaluation on Depression Diagnosis
arxiv.orgยท2h
๐Ÿ“ŠSpectrograms
Flag this post
The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
arxiv.orgยท2h
๐ŸŽ™๏ธWhisper
Flag this post
Foley Control: Aligning a Frozen Latent Text-to-Audio Model to Video
arxiv.orgยท1d
๐ŸŽงVorbis Encoding
Flag this post
Part 1: Training a Neural Network to Detect Coffee First Crack from Audio - An Agentic Development Journey with Warp
dev.toยท9hยท
Discuss: DEV
๐Ÿ”FLAC Forensics
Flag this post