🔄 Transformers - moyutianzun · Scour

AI Chart Understanding Breakthrough: MIT-IBM Dataset Lets Small Models Beat GPT-4o

📊LLM Evaluation

techtimes.com·

Less-relevant results

Pathetic pretense

🔍RAG Blog

freethoughtblogs.com·

Google's new open model DiffusionGemma generates text from noise instead of word by word

the-decoder.com

·

Why LLMs hallucinate?

📊LLM Evaluation Blog

·

What the ocean taught me about AI.

🤖agentic system Blog

Surprise upset: GPT-5.5 beats Claude Fable 5 on brutal new Agents’ Last Exam benchmark

🤖agentic system

venturebeat.com·

Treble Technologies and Hugging Face Address Benchmark of Automatic Speech Recognition Models

🎛️Fine-Tuning

audioxpress.com·

Kuramoto Attention: Synchronizing Self-Attention on the Torus

⚡FlashAttention Academic

Breaking tunnel vision, imaging AI lifts fluorescence image restoration accuracy and speed

⚡FlashAttention

OpenCV 5 Debuts with Improved ONNX Support and Native AI Upgrades

⚡FlashAttention News

Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model

⚡Inference Optimization Academic

Google open-sources speedy DiffusionGemma text diffusion model

🎭Mixture of Experts

siliconangle.com·

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

venturebeat.com··Hacker News

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

📊LLM Evaluation

·

NVIDIA's Cosmos 3: The World's First Fully Open AI Omnimodel

🤖agentic system News

aimagazine.com·

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

⚡FlashAttention Code

github.com··Hacker News

DiffusionGemma: 4x Faster Text Generation

🎭Mixture of Experts News Blog

blog.google··Hacker News, r/LocalLLaMA, r/singularity

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

🎛️Fine-Tuning Blog

huggingface.co·

Microsoft Research's Lens proves detailed captions matter more than raw scale for training efficient image generators

⚙post training infra News

the-decoder.com

·

Apple WWDC On-Device AI Deep Dive - Google Docs

🎛️Fine-Tuning

gist.is··Hacker News

Log in to enable infinite scrolling