🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🗣️ CMU Pronouncing

Phonetic Dictionaries, Speech Synthesis, Linguistic Resources, Audio Processing

Show HN: Requests-Based Google Maps Scraper
apify.com·7h·
Discuss: Hacker News
🔍BitFunnel
Stop Words Using Spacy - NLP
dev.to·1d·
Discuss: DEV
📝Text Parsing
Learning to assess subjective impressions from speech
arxiv.org·23h
👁️Perceptual Coding
Machine Learning Fundamentals: active learning tutorial
dev.to·11h·
Discuss: DEV
🎛️Feed Filtering
Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages
arxiv.org·1d
🎵Audio ML
Mind the Gap: Assessing Wiktionary's Crowd-Sourced Linguistic Knowledge on Morphological Gaps in Two Related Languages
arxiv.org·1d
🔤Morphological Analysis
The Bitter Lesson is coming for Tokenization
lucalp.dev·1d·
Discuss: Lobsters, Hacker News, r/programming
🔗Monadic Parsing
OpusLM: A Family of Open Unified Speech Language Models
arxiv.org·1d
🎵Audio ML
Generalizing Vision-Language Models to Novel Domains: A Comprehensive Survey
arxiv.org·1d
🤖Advanced OCR
A Complete Guide to Retrieval-Augmented Generation
dev.to·2d·
Discuss: DEV
🌀Brotli Internals
SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music Understanding
arxiv.org·1d
🎵Audio ML
One Does Not Simply 'Mm-hmm': Exploring Backchanneling in the AAC Micro-Culture
arxiv.org·1d
👂Psychoacoustics
JIS: A Speech Corpus of Japanese Idol Speakers with Various Speaking Styles
arxiv.org·1d
🇯🇵Japanese Computing
Patterns for Compounding the Value of LLM interactions
spin.atomicobject.com·15h·
Discuss: Hacker News
🔗Constraint Handling
Machine Learning Fundamentals: active learning project
dev.to·12h·
Discuss: DEV
🧠Machine Learning
Unleashing Creativity with ElevenLabs: A Developer’s Guide to AI Voice Technology
dev.to·20h·
Discuss: DEV
🎙️Whisper
Scaffolding Dexterous Manipulation with Vision-Language Models
arxiv.org·23h
🤖Advanced OCR
ElevenLabs Launches Mobile App for Voice Generation on iOS and Android
dev.to·11h·
Discuss: DEV
🎙️Whisper
MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility Applications
arxiv.org·23h
🎙️Whisper
DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling
arxiv.org·1d
✨Effect Systems
Loading...Loading more...
AboutBlogChangelogRoadmap