Hume AI Octave 2: new text-to-speech model, 11+ languages
hume.ai·1d·
Discuss: Hacker News
🎙️Whisper
Building an vision language model from scratch
poonai.xyz·3d·
Discuss: Hacker News
🤖Paleographic ML
Building a Command-Line Quiz Application in R
towardsdatascience.com·3h
🐚Shell Calculus
I Do Not Want to Be a Programmer Anymore
mindthenerd.com·3h·
Discuss: Hacker News
Proof Automation
Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions
arxiv.org·2d
📊Quantization
Building Ethical AI: A Comprehensive Guide to Responsible Artificial Intelligence
dev.to·2d·
Discuss: DEV
🤖AI Curation
Show HN: TorchSystem, Event driven systems with PyTorch
github.com·1d·
Discuss: Hacker News
Incremental Computation
Sora 2: AI Video Generation with Realistic Sound
2-sora.com·2d·
Discuss: Hacker News
🎧Learned Audio
AudioMoG: Guiding Audio Generation with Mixture-of-Guidance
arxiv.org·5d
🎧Learned Audio
AI-Driven Predictive Maintenance of Microfluidic Injector Arrays for Enhanced Bioreactor Performance
dev.to·14h·
Discuss: DEV
🏠Homelab Orchestration
Hyperdimensional Biomarker Discovery via Probabilistic Causal Graph Optimization
dev.to·4h·
Discuss: DEV
🧠Machine Learning
GFSR-Net: Guided Focus via Segment-Wise Relevance Network for Interpretable Deep Learning in Medical Imaging
arxiv.org·2d
📊Learned Metrics
RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
arxiv.org·2d
🔲Cellular Automata
BioVERSE: Representation Alignment of Biomedical Modalities to LLMs for Multi-Modal Reasoning
arxiv.org·2d
🌳Context free grammars
Toward a Realistic Encoding Model of Auditory Affective Understanding in the Brain
arxiv.org·6d
👁️Perceptual Coding
Choosing the Right AI Model for Stock Prediction
dev.to·12h·
Discuss: DEV
🧠Machine Learning
Ship Faster Without Breaking Things: DORA 2025 in Real Life
dev.to·1h·
Discuss: DEV
🔄Language Evolution
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
arxiv.org·6d
🎧Vorbis Encoding