DeepSeek-OCR + LLama4 + RAG Just Revolutionized Agent OCR Forever
dev.to·1h·
Discuss: DEV
🤖Advanced OCR
Flag this post
Mubeen AI: A Specialized Arabic Language Model for Heritage Preservation and User Intent Understanding
arxiv.org·1d
🤖Paleographic ML
Flag this post
Beyond the Magic: How LLMs Work
tag1.com·22h·
Discuss: Hacker News
💻Local LLMs
Flag this post
Wednesday 26 November - 11am
informatics.ed.ac.uk·23h
🎵Audio ML
Flag this post
New Dataset PerSense-D Enables Model-Agnostic Dense Object Segmentation
hackernoon.com·13h
📊Learned Metrics
Flag this post
Everything You Need to Know About Character AI
news.ycombinator.com·20h·
Discuss: Hacker News
🎙️Whisper
Flag this post
Algorithmic Bias Mitigation via Contrastive Fairness Learning with Adaptive Data Augmentation
dev.to·3h·
Discuss: DEV
📊Learned Metrics
Flag this post
CURVETE: Curriculum Learning and Progressive Self-supervised Training for Medical Image Classification
arxiv.org·1d
🌀Differential Geometry
Flag this post
Sparse Adaptive Attention “MoE”: How I Solved OpenAI’s $650B Problem With a £700 GPU
medium.com·17h·
🌊Streaming Algorithms
Flag this post
Amazon Nova Multimodal Embeddings: State-of-the-art embedding model for agentic RAG and semantic search
aws.amazon.com·17h·
Discuss: Hacker News
🧮Vector Embeddings
Flag this post
Advancing cybersecurity a comprehensive review of AI-driven detection techniques
journalofbigdata.springeropen.com·6h·
Discuss: Hacker News
🎯Threat Hunting
Flag this post
Do We Still Need OCR?
pageindex.ai·1d·
🤖Advanced OCR
Flag this post
ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level EntropyShaping
dev.to·15h·
Discuss: DEV
Proof Automation
Flag this post
Quantifying the Effects of Word Length, Frequency, and Predictability on Dyslexia
arxiv.org·4h
🧠Intelligence Compression
Flag this post
DeepSeek-OCR: Images Simplify Text for Large Language Models
heise.de·4d
🤖Advanced OCR
Flag this post
Vision-Driven OCR for Long Documents: How Images Compress Text for LLMs
dev.to·1d·
Discuss: DEV
🤖Advanced OCR
Flag this post
Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs
arxiv.org·4h
💻Local LLMs
Flag this post
Abjad AI at NADI 2025: CATT-Whisper: Multimodal Diacritic Restoration Using Text and Speech Representations
arxiv.org·4h
🎙️Whisper
Flag this post
Convert any GitHub repo to coding puzzles
github.com·14h·
Discuss: Hacker News
Proof Automation
Flag this post