๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ—ฃ๏ธ CMU Pronouncing

Phonetic Dictionaries, Speech Synthesis, Linguistic Resources, Audio Processing

Claude Code's 19 cent Parser
blogger.comยท1d
๐Ÿ”งBinary Parsers
Paradigms of Intelligence Team
github.comยท3hยท
Discuss: Hacker News
๐Ÿ”ฒCellular Automata
Constrained Prompt Enhancement for Improving Zero-Shot Generalization of Vision-Language Models
arxiv.orgยท12h
๐Ÿง Neural Codecs
Dissonance: A journey through musical possibility space
aatishb.comยท3dยท
Discuss: Hacker News
๐ŸŒˆSpectral Audio
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization
arxiv.orgยท12h
๐ŸงฎKolmogorov Bounds
Learning JavaScript Promises the Feynman Way (With AI Assistance)
jakeworth.comยท3hยท
Discuss: Hacker News
โš”๏ธLean Tactics
Mini-Omni-Reasoner: Token-Level Thinking-in-Speaking in Large Speech Models
arxiv.orgยท1d
๐ŸŽ™๏ธWhisper
Show HN: Cosmic AI Platform โ€“ Build and deploy CMS sites using natural language
cosmicjs.comยท1hยท
Discuss: Hacker News
๐Ÿ”—Hypermedia APIs
Nvidia Release Massive AI-Ready Open European Language Dataset and Tools
hardware.slashdot.orgยท2d
๐ŸŽ™๏ธWhisper
Fine-Tuning and Deploying GPT Models Using Hugging Face Transformers
blog.jetbrains.comยท1d
๐Ÿค–Grammar Induction
The Impact of Visual Segmentation on Lexical Word Recognition
arxiv.orgยท12h
๐Ÿ“„OCR
Show HN: Voice Typing from Your Terminal
github.comยท2dยท
Discuss: Hacker News
๐ŸŽ™๏ธWhisper
CausalSent: Interpretable Sentiment Classification with RieszNet
arxiv.orgยท12h
โš–๏ธFeed Ranking
Virtual Reality in Sign Language Education: Opportunities, Challenges, and the Road Ahead
arxiv.orgยท12h
โœ‹Tactile Computing
A XAI-based Framework for Frequency Subband Characterization of Cough Spectrograms in Chronic Respiratory Disease
arxiv.orgยท1d
๐Ÿ“ŠSpectrograms
Next-gen voice, video, and chat messaging using your domain name not your number
thunderbolt.comยท2dยท
Discuss: Hacker News
๐Ÿ”ŒOperating system internals
Toward Socially Aware Vision-Language Models: Evaluating Cultural Competence Through Multimodal Story Generation
arxiv.orgยท12h
๐ŸŒCultural Algorithms
Positional Embeddings in Transformers: A Math Guide to RoPE & ALiBi
towardsdatascience.comยท2h
๐Ÿ“Geometric Hashing
Test-Time Scaling Strategies for Generative Retrieval in Multimodal Conversational Recommendations
arxiv.orgยท12h
๐Ÿ”Information Retrieval
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models
arxiv.orgยท12h
๐Ÿค–Advanced OCR
Loading...Loading more...
AboutBlogChangelogRoadmap