Know Audio: Lossy Compression Algorithms And Distortion
hackaday.comยท9h
๐ตAudio Codecs
Flag this post
Linear Audio Dreams: Injecting Sanity into Autoencoder Latent Spaces by Arvind Sundararajan
๐งLearned Audio
Flag this post
I'm a recording musician and these are my favorite headphones (so you can look for them on Black Friday)
techradar.comยท11h
๐ผAudio Cassettes
Flag this post
Bridging Minds and Machines
๐ง Neural Codecs
Flag this post
HiFi-HARP: A High-Fidelity 7th-Order Ambisonic Room Impulse Response Dataset
arxiv.orgยท1d
๐Psychoacoustic Coding
Flag this post
Swift Sings: MDX-Net Vocal Splits and RVC Voice Conversion On-Device with ONNX/CoreML
web.navan.devยท2d
๐งLearned Audio
Flag this post
Active Noise Cancellation via Multi-Modal Acoustic Surface Metamaterial Structures for In-Cabin Vehicle Acoustics
๐งCassette Engineering
Flag this post
LibriConvo: Simulating Conversations from Read Literature for ASR and Diarization
arxiv.orgยท22h
๐ตAudio ML
Flag this post
Enhanced Spectral Decomposition for High-Dimensional Bio-Signal Classification
๐Spectral Methods
Flag this post
RatioWaveNet: A Learnable RDWT Front-End for Robust and Interpretable EEG Motor-Imagery Classification
arxiv.orgยท22h
๐Learned Metrics
Flag this post
How to Apply Powerful AI Audio Models to Real-World Applications
towardsdatascience.comยท1d
๐ตAudio ML
Flag this post
Calligraphers and Storytellers
๐ฏ๐ตJapanese Computing
Flag this post
How to Transcribe Lectures Longer Than 2 Hours Without Time Limits?
๐Document Phonetics
Flag this post
The Best Audio Interfaces of 2025: Universal Audio and More
wired.comยท4d
๐งAudio Mastering
Flag this post
Quantifying Somatic Marker Correlations in Guided Mindfulness Meditation via Bio-Acoustic Analysis and Deep Neural Networks
๐Spectral Audio
Flag this post
Frequency-Spatial Interaction Driven Network for Low-Light Image Enhancement
arxiv.orgยท22h
๐Rate-Distortion Theory
Flag this post
Compositional Bias Control in Large Language Models: Preference Learning Fails, Supervision Succeeds
arxiv.orgยท22h
๐๏ธFeed Filtering
Flag this post
Audio Frequency-Time Dual Domain Evaluation on Depression Diagnosis
arxiv.orgยท22h
๐Spectrograms
Flag this post
I Used Smart Glasses to Trick a Bartender into Giving Me a Free Drink
๐ผCassette Hacking
Flag this post
VoxScribe: A platform to test Opensource Speech-to-Text models
blog.devops.devยท6h
๐๏ธWhisper
Flag this post
Loading...Loading more...