🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
πŸ‘‚ Psychoacoustic Coding

Perceptual Audio, Masking Models, Hearing Science, Lossy Compression

Ripping CDs the old way
thefoggiest.devΒ·19h
πŸ“ΌCassette Archaeology
Testing AirPods 4’s Beta Update and Improved Recording Quality for Voice Notes
macstories.netΒ·1d
🎡Audio Codecs
Oblique Strategies for Vibe Coding
useyourexperience.comΒ·2dΒ·
Discuss: Hacker News
πŸ‡ΈπŸ‡ͺNordic Algorithms
Next iteration of our Voice Assistant is here - Voice chapter 10
home-assistant.ioΒ·1dΒ·
Discuss: Hacker News
πŸŽ™οΈWhisper
Show HN: TableSprint- Supabase alternative with vibe coding features
tablesprint.comΒ·1dΒ·
Discuss: Hacker News
πŸ“²Digitization
Challenging projects every programmer should try
austinhenley.comΒ·4hΒ·
Discuss: Hacker News
πŸ“Compiler Design
Robust Foreground-Background Separation for Severely-Degraded Videos Using Convolutional Sparse Representation Modeling
arxiv.orgΒ·2d
πŸ‘οΈPerceptual Hashing
I tried to mimic the human brain to rethink how AI works – Introducing NeuroCode
dev.toΒ·3hΒ·
Discuss: DEV
πŸ”²Cellular Automata
Towards Interpretable Adversarial Examples via Sparse Adversarial Attack
arxiv.orgΒ·2d
πŸ•΅οΈVector Smuggling
OpusLM: A Family of Open Unified Speech Language Models
arxiv.orgΒ·2d
🎡Audio ML
ReMAR-DS: Recalibrated Feature Learning for Metal Artifact Reduction and CT Domain Transformation
arxiv.orgΒ·1d
πŸ“ŠLearned Metrics
Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices
arxiv.orgΒ·2d
🌊Streaming Compression
Visual hallucination detection in large vision-language models via evidential conflict
arxiv.orgΒ·1d
πŸ“ŠLearned Metrics
SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music Understanding
arxiv.orgΒ·2d
🎡Audio ML
CCRS: A Zero-Shot LLM-as-a-Judge Framework for Comprehensive RAG Evaluation
arxiv.orgΒ·15h
πŸ“Linear Logic
Rethinking Mean Opinion Scores in Speech Quality Assessment: Aggregation through Quantized Distribution Fitting
arxiv.orgΒ·2d
🎡Audio ML
FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation
arxiv.orgΒ·2dΒ·
Discuss: Hacker News
🧠Neural Codecs
Face-Voice Association for Audiovisual Active Speaker Detection in Egocentric Recordings
arxiv.orgΒ·2d
πŸ”ŠAcoustic Forensics
Automatic Depression Assessment using Machine Learning: A Comprehensive Survey
arxiv.orgΒ·1d
🎡Audio ML
HunyuanVideo-Avatar: The Breakthrough That’s Revolutionizing AI-Driven Human Animation
dev.toΒ·1dΒ·
Discuss: DEV
🎬AV1 Encoding
Loading...Loading more...
AboutBlogChangelogRoadmap