Transformers

Feeds to Scour
SubscribedAll
Scoured 317 posts in 8.0 ms

AI Chart Understanding Breakthrough: MIT-IBM Dataset Lets Small Models Beat GPT-4o

 📊LLM Evaluation
techtimes.com·
Less-relevant results

Pathetic pretense

 🔍RAG  Content type: Blog

Google's new open model DiffusionGemma generates text from noise instead of word by word

 💾KV Cache
the-decoder.com
·

Why LLMs hallucinate?

 📊LLM Evaluation  Content type: Blog
medium.com
·

What the ocean taught me about AI.

 🤖agentic system  Content type: Blog
medium.com·

Surprise upset: GPT-5.5 beats Claude Fable 5 on brutal new Agents’ Last Exam benchmark

 🤖agentic system
venturebeat.com·

Treble Technologies and Hugging Face Address Benchmark of Automatic Speech Recognition Models

 🎛️Fine-Tuning
audioxpress.com·

Kuramoto Attention: Synchronizing Self-Attention on the Torus

 FlashAttention  Content type: Academic
arxiv.org·

Breaking tunnel vision, imaging AI lifts fluorescence image restoration accuracy and speed

 FlashAttention
phys.org·

OpenCV 5 Debuts with Improved ONNX Support and Native AI Upgrades

 FlashAttention  Content type: News
hackster.io·

Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model

 Inference Optimization  Content type: Academic
nature.com·

Google open-sources speedy DiffusionGemma text diffusion model

 🎭Mixture of Experts
siliconangle.com·

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

 🔍RAG

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 📊LLM Evaluation
techradar.com
·

NVIDIA's Cosmos 3: The World's First Fully Open AI Omnimodel

 🤖agentic system  Content type: News
aimagazine.com·

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

 FlashAttention  Content type: Code
github.com··Hacker News

DiffusionGemma: 4x Faster Text Generation

 🎭Mixture of Experts  Content type: News  Content type: Blog

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

 🎛️Fine-Tuning  Content type: Blog
huggingface.co·

Microsoft Research's Lens proves detailed captions matter more than raw scale for training efficient image generators

 post training infra  Content type: News
the-decoder.com
·

Apple WWDC On-Device AI Deep Dive - Google Docs

 🎛️Fine-Tuning
gist.is··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help