LibriConvo: Simulating Conversations from Read Literature for ASR and Diarization
arxiv.org·9h
🎵Audio ML
Flag this post
Scripts That Don’t Fit: The Hidden Bias of NLP in South Asian Languages
digitalorientalist.com·25m
🏛Digital humanities
Flag this post
How to Apply Powerful AI Audio Models to Real-World Applications
towardsdatascience.com·17h
🎵Audio ML
Flag this post
New method can measure ocean acidification using ambient wind noise
phys.org·1h
📡Frequency Archaeology
Flag this post
Swift Sings: MDX-Net Vocal Splits and RVC Voice Conversion On-Device with ONNX/CoreML
web.navan.dev·1d
🎧Learned Audio
Flag this post
Do We Still Need OCR?
🤖Advanced OCR
Flag this post
How to Get to The End of a Pile of Unread Books
candost.blog·11h
📃Manuscript Tokenization
Flag this post
The Reel Deal: Why Audiovisual Heritage Matters
dpconline.org·16h
🏺Media Archaeology
Flag this post
Wednesday 26 November - 11am
informatics.ed.ac.uk·3h
🎵Audio ML
Flag this post
A Sociophonetic Analysis of Racial Bias in Commercial ASR Systems Using the Pacific Northwest English Corpus
arxiv.org·9h
🗣️CMU Pronouncing
Flag this post
Fix Your Paper Reading Game
jalexine.github.io·7h
🔬Academic Search
Flag this post
AI and the End of Accents
wired.com·1d
🗣️CMU Pronouncing
Flag this post
Medical Speech AI Platform: Corti Gears Up for Psychiatry and More
heise.de·23h
🎵Audio ML
Flag this post
The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
arxiv.org·9h
🎙️Whisper
Flag this post
Text to Speech Sam
🗣️CMU Pronouncing
Flag this post
Show HN: The Σ-Manifold Manifesto
🏛Digital humanities
Flag this post
"the densest and longest lasting human readable information storage media"
🌡️Preservation Physics
Flag this post
Loading...Loading more...