👂 Psychoacoustics - matmat · Scour

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

arxiv.org·23h

🎧Learned Audio

MiniMax Music 2.5: The AI Music Tool That Puts Creators in Control of Studio-Quality Sound

open.forem.com·22h·

Discuss: DEV

🎧Learned Audio

billposer.org·4d

🎵Audio Formats

Audio Components

forums.anandtech.com·1d

🎚️Audio Production

It's all a blur

lcamtuf.substack.com·1d·

Discuss: Substack

📊Rate-Distortion Theory

Linux Audio Developer's Simple Plugin API (LADSPA)

ladspa.org·15h

💿FLAC Archaeology

Sounding Highlights: Dual-Pathway Audio Encoders for Audio-Visual Video Highlight Detection

arxiv.org·1d

👁️Perceptual Coding

Build in a Day: AI Video Clipping with CE.SDK

img.ly·1d

MichiAI: A 530M Full-Duplex Speech LLM with ~75ms Latency Using Flow Matching

ketsuilabs.io·3d·

Discuss: Hacker News

Voxtral transcribes at the speed of sound

simonwillison.net·2d

💿FLAC Archaeology

Discover Anthropic's Claude Opus 4.6: Advanced Agentic Coding Features

dev.to·1d·

Discuss: DEV

📚MARC Evolution

Prompt Fidelity: Measuring How Much of Your Intent an AI Agent Actually Executes

towardsdatascience.com·16h

🔍FLAC Forensics

Game Boy Advance Audio Interpolation

jsgroth.dev·2d·

Discuss: Hacker News

🎵Gameboy Sound

On listening to the space between: narrative causality, parasitical stories, and language models

electricarchaeology.ca·3d

🕸️Hypertext Archaeology

What Spectroscopy Was to the 1800s, Embeddings Are to Science Now

mnky9800n.substack.com·2d·

Discuss: Substack

🧠Machine Learning

AudioCodes Ltd. (AUDC) Q4 2025 Earnings Call Transcript

seekingalpha.com·3d

🎵Audio Codecs

AI ASMR Voice: Free AI ASMR Voice Generator

aiasmrvoice.com·2d·

Discuss: Hacker News

🎧Learned Audio

Future leakage in block-quantized attention

matx.com·4d·

Discuss: Hacker News

📊Vector Quantization

End-to-end Continuous Speech Recognition using Attention-based Recurrent NN:First Results

dev.to·1d·

Discuss: DEV

🎧Learned Audio

charstorm/vilberta: Voice chatbot with voice+screen output to show that "not everything needs to be spoken"

github.com·1d·

Discuss: Hacker News

Loading more...