Know Audio: Lossy Compression Algorithms And Distortion
hackaday.comยท9h
๐ŸŽตAudio Codecs
Flag this post
Linear Audio Dreams: Injecting Sanity into Autoencoder Latent Spaces by Arvind Sundararajan
dev.toยท7hยท
Discuss: DEV
๐ŸŽงLearned Audio
Flag this post
I'm a recording musician and these are my favorite headphones (so you can look for them on Black Friday)
techradar.comยท11h
๐Ÿ“ผAudio Cassettes
Flag this post
Bridging Minds and Machines
ofcarbonandsilicon.substack.comยท3hยท
Discuss: Substack
๐Ÿง Neural Codecs
Flag this post
HiFi-HARP: A High-Fidelity 7th-Order Ambisonic Room Impulse Response Dataset
arxiv.orgยท1d
๐Ÿ‘‚Psychoacoustic Coding
Flag this post
Swift Sings: MDX-Net Vocal Splits and RVC Voice Conversion On-Device with ONNX/CoreML
web.navan.devยท2d
๐ŸŽงLearned Audio
Flag this post
Active Noise Cancellation via Multi-Modal Acoustic Surface Metamaterial Structures for In-Cabin Vehicle Acoustics
dev.toยท1dยท
Discuss: DEV
๐Ÿ”งCassette Engineering
Flag this post
LibriConvo: Simulating Conversations from Read Literature for ASR and Diarization
arxiv.orgยท22h
๐ŸŽตAudio ML
Flag this post
Enhanced Spectral Decomposition for High-Dimensional Bio-Signal Classification
dev.toยท1dยท
Discuss: DEV
๐ŸŒˆSpectral Methods
Flag this post
RatioWaveNet: A Learnable RDWT Front-End for Robust and Interpretable EEG Motor-Imagery Classification
arxiv.orgยท22h
๐Ÿ“ŠLearned Metrics
Flag this post
How to Apply Powerful AI Audio Models to Real-World Applications
towardsdatascience.comยท1d
๐ŸŽตAudio ML
Flag this post
Calligraphers and Storytellers
sibylline.devยท7hยท
Discuss: Hacker News
๐Ÿ‡ฏ๐Ÿ‡ตJapanese Computing
Flag this post
How to Transcribe Lectures Longer Than 2 Hours Without Time Limits?
dev.toยท17hยท
Discuss: DEV
๐Ÿ“„Document Phonetics
Flag this post
The Best Audio Interfaces of 2025: Universal Audio and More
wired.comยท4d
๐ŸŽงAudio Mastering
Flag this post
Quantifying Somatic Marker Correlations in Guided Mindfulness Meditation via Bio-Acoustic Analysis and Deep Neural Networks
dev.toยท9hยท
Discuss: DEV
๐ŸŒˆSpectral Audio
Flag this post
Frequency-Spatial Interaction Driven Network for Low-Light Image Enhancement
arxiv.orgยท22h
๐Ÿ“ŠRate-Distortion Theory
Flag this post
Compositional Bias Control in Large Language Models: Preference Learning Fails, Supervision Succeeds
arxiv.orgยท22h
๐ŸŽ›๏ธFeed Filtering
Flag this post
Audio Frequency-Time Dual Domain Evaluation on Depression Diagnosis
arxiv.orgยท22h
๐Ÿ“ŠSpectrograms
Flag this post
I Used Smart Glasses to Trick a Bartender into Giving Me a Free Drink
lifehacker.comยท11hยท
Discuss: Hacker News
๐Ÿ“ผCassette Hacking
Flag this post
VoxScribe: A platform to test Opensource Speech-to-Text models
blog.devops.devยท6h
๐ŸŽ™๏ธWhisper
Flag this post