🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🗣️ CMU Pronouncing

Phonetic Dictionaries, Speech Synthesis, Linguistic Resources, Audio Processing

Zero-shot Context Biasing with Trie-based Decoding using Synthetic Multi-Pronunciation
arxiv.org·15h
🎙️Whisper
VibeVoice (1.5B) - TTS model by Microsoft
huggingface.co·1d·
Discuss: Hacker News, r/LocalLLaMA
🎙️Whisper
Google NotebookLM goes global with multilingual AI video summaries of your notes
techradar.com·16h
🏛Digital humanities
Google’s URL Context Grounding: Another Nail in RAG’s Coffin?
towardsdatascience.com·6h
🌀Brotli Internals
MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols
arxiv.org·15h
🎙️Whisper
Clay Shirky: The Only Real Solution to the A.I. Cheating Crisis
nytimes.com·5h·
Discuss: Hacker News
🎯Interactive Provers
Google Translate's latest feature is its take on Duolingo
engadget.com·3h
🎙️Whisper
TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling
arxiv.org·15h
🎙️Whisper
A Universal Rhythm Guides How We Speak: Global Analysis Reveals 1.6-second Units
science.slashdot.org·1d
🎵Music Universality
Using Real Survey Data to Create Authentic AI Personas for Extended Research
askrally.com·6h·
Discuss: Hacker News
🏛Digital humanities
Benchmarking GPT-5 vs Claude 4 Sonnet on 200 Requests
dev.to·4h·
Discuss: DEV
⚙️Compression Benchmarking
Wan-S2V: Audio-Driven Cinematic Video Generation
humanaigc.github.io·4h·
Discuss: Hacker News
⏱️SMPTE Timecode
Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study
arxiv.org·15h
🎙️Whisper
Show HN: Unlingo – open-source platform for localization
unlingo.com·2h·
Discuss: Hacker News
🇯🇵Japanese Computing
Wubular: Rubular Reimagined in Ruby+WASM
rubyelders.com·5h·
Discuss: Hacker News
🌀Brotli Internals
LingVarBench: Benchmarking LLM for Automated Named Entity Recognition in Structured Synthetic Spoken Transcriptions
arxiv.org·1d
⚙️Compression Benchmarking
EyeMulator: Improving Code Language Models by Mimicking Human Visual Attention
arxiv.org·15h
📊Feed Optimization
New AI-powered live translation and language learning tools in Google Translate
blog.google·3h·
Discuss: Hacker News, r/Android
🤖AI Translation
Using Gemini prompts for Suno's Cover/Remix helps unblock creative projects
backpocketmusic.com·19h·
Discuss: Hacker News
🎧Learned Audio
DocHop-QA: Towards Multi-Hop Reasoning over Multimodal Document Collections
arxiv.org·1d
📇Dublin Core
Loading...Loading more...
AboutBlogChangelogRoadmap