Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Back to article
Attention is all you need (2017)
(opens in new tab)
37
articles covering this post
arxiv.org
·
60w
60 weeks ago
·
DEV
,
Hacker News
,
Hacker News
·
Open original
(opens in new tab)
Save
Love
Like
Dislike
|
Add interest
Feeds
Share
|
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block
Add interest
Show Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Covered in 37 articles
How to Tame AI’s Voracious Appetite for Energy
nautil.us
·
3w
3 weeks ago
·
Hacker News
Actions for How to Tame AI’s Voracious Appetite for Energy
Emerging Patterns in Building GenAI Products
martinfowler.com
·
3w
3 weeks ago
·
Hacker News
Actions for Emerging Patterns in Building GenAI Products
Vision LLMs are PDF Parsers Too: Reading Charts and Diagrams for RAG
towardsdatascience.com
·
4d
4 days ago
Actions for Vision LLMs are PDF Parsers Too: Reading Charts and Diagrams for RAG
Parse PDFs for RAG Locally with Docling: Rich Tables, No Cloud Upload
towardsdatascience.com
·
5d
5 days ago
Actions for Parse PDFs for RAG Locally with Docling: Rich Tables, No Cloud Upload
Stop Returning Flat Text from a PDF: The Relational Shape RAG Needs
towardsdatascience.com
·
1w
1 week ago
Actions for Stop Returning Flat Text from a PDF: The Relational Shape RAG Needs
Beyond extract_text: The Two Layers of a PDF That Drive RAG Quality
towardsdatascience.com
·
1w
1 week ago
Actions for Beyond extract_text: The Two Layers of a PDF That Drive RAG Quality
RAG Is Not Machine Learning, and the ML Toolkit Solves the Wrong Problem
towardsdatascience.com
·
2w
2 weeks ago
Actions for RAG Is Not Machine Learning, and the ML Toolkit Solves the Wrong Problem
Embeddings Aren’t Magic: The Predictable Failure Modes of RAG Retrieval
towardsdatascience.com
·
2w
2 weeks ago
Actions for Embeddings Aren’t Magic: The Predictable Failure Modes of RAG Retrieval
Baseline Enterprise RAG, From PDF to Highlighted Answer
towardsdatascience.com
·
2w
2 weeks ago
Actions for Baseline Enterprise RAG, From PDF to Highlighted Answer
Unpacking AI: The Hardware Behind AI
pathtostaff.com
·
1w
1 week ago
·
Hacker News
Actions for Unpacking AI: The Hardware Behind AI
5 Fun Papers That Explain LLMs Clearly
kdnuggets.com
·
2w
2 weeks ago
Actions for 5 Fun Papers That Explain LLMs Clearly
AI Coding Tip 024 - Force a Criteria Check Before the Task Ends
dev.to
·
2d
2 days ago
·
DEV
Actions for AI Coding Tip 024 - Force a Criteria Check Before the Task Ends
91. The Transformer Architecture: The Invention That Changed AI
dev.to
·
4w
4 weeks ago
·
DEV
Actions for 91. The Transformer Architecture: The Invention That Changed AI
The usual implementaiton of attention transformers (SDPA) is kind of bad, actually
gist.github.com
·
4w
4 weeks ago
·
Hacker News
Actions for The usual implementaiton of attention transformers (SDPA) is kind of bad, actually
How LLMs Work, Part 1: How LLMs Process Text
shbhmrzd.github.io
·
3w
3 weeks ago
·
r/programming
Actions for How LLMs Work, Part 1: How LLMs Process Text
FareedKhan-dev/train-llm-from-scratch: A straightforward method for training your LLM, from downloading data to generating text.
github.com
·
2w
2 weeks ago
Actions for FareedKhan-dev/train-llm-from-scratch: A straightforward method for training your LLM, from downloading data to generating text.
wisnunugroho21/nugie-jax-nemotron: A simple, minimalistic, and explainable code implementation of of Nemotron 3 Nano in JAX
github.com
·
5w
5 weeks ago
·
r/learnmachinelearning
Actions for wisnunugroho21/nugie-jax-nemotron: A simple, minimalistic, and explainable code implementation of of Nemotron 3 Nano in JAX
Language Models Struggle to Keep a Secret
unite.ai
·
4w
4 weeks ago
Actions for Language Models Struggle to Keep a Secret
Current AI Model Inadequacies: Implications for the Global South
orfonline.org
·
4w
4 weeks ago
Actions for Current AI Model Inadequacies: Implications for the Global South
AI Paper Review: Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
freecodecamp.org
·
2d
2 days ago
Actions for AI Paper Review: Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Evaluating the role of pretraining dataset size and diversity on single-cell foundation model performance
nature.com
·
1w
1 week ago
Actions for Evaluating the role of pretraining dataset size and diversity on single-cell foundation model performance
The AI Consciousness Debate Is Happening at the Wrong Level
recursiveintelligence.io
·
3w
3 weeks ago
·
r/cogsci
,
r/neurophilosophy
Actions for The AI Consciousness Debate Is Happening at the Wrong Level
A deep dive into the Transformer architecture
blog.algomaster.io
·
5w
5 weeks ago
Actions for A deep dive into the Transformer architecture
The Memo - 13/Jun/2026
lifearchitect.substack.com
·
5d
5 days ago
·
Substack
Actions for The Memo - 13/Jun/2026
The Anatomy of an LLM | Interactive Visual Guide to How Language Models Work
royvanrijn.com
·
3w
3 weeks ago
·
Hacker News
Actions for The Anatomy of an LLM | Interactive Visual Guide to How Language Models Work
How LLMs Actually Work: A Friendly Map for Humans • oreoro
oreoro.github.io
·
1w
1 week ago
·
Hacker News
Actions for How LLMs Actually Work: A Friendly Map for Humans • oreoro
Understanding KV Cache: The Hidden Memory Cost of Serving LLMs
melchi.me
·
4w
4 weeks ago
·
Hacker News
Actions for Understanding KV Cache: The Hidden Memory Cost of Serving LLMs
Sebastian Mallaby, Biographer of Demis Hassabis — Lessons from 100+ AI Insiders on The Race to Superintelligence, The Religion of AI, and Spotting Breakthroughs...
tim.blog
·
1d
1 day ago
Actions for Sebastian Mallaby, Biographer of Demis Hassabis — Lessons from 100+ AI Insiders on The Race to Superintelligence, The Religion of AI, and Spotting Breakthroughs...
Thread by @KyeGomezB on Thread Reader App
threadreaderapp.com
·
4w
4 weeks ago
Actions for Thread by @KyeGomezB on Thread Reader App
How LLM Inference Works
arpitbhayani.me
·
5w
5 weeks ago
·
Hacker News
Actions for How LLM Inference Works
AI 101: Your Ultimate Guide to Attention: Mechanism, QKV, and KV Cache
turingpost.com
·
5w
5 weeks ago
Actions for AI 101: Your Ultimate Guide to Attention: Mechanism, QKV, and KV Cache
Inside the Transformer: The Life of a Token
aleksagordic.com
·
3w
3 weeks ago
·
Hacker News
Actions for Inside the Transformer: The Life of a Token
Self Attention
byhand.ai
·
3w
3 weeks ago
Actions for Self Attention
Give your agents disposable environments in Go
tigrisdata.com
·
3w
3 weeks ago
Actions for Give your agents disposable environments in Go
"Agentic AI" Is a Bonfire of the Tokens While Fab Capacity, Power Grids, and P&Ls Are the brakes: (NOT THE) READ OF THE DAY
braddelong.substack.com
·
2w
2 weeks ago
·
Substack
Actions for "Agentic AI" Is a Bonfire of the Tokens While Fab Capacity, Power Grids, and P&Ls Are the brakes: (NOT THE) READ OF THE DAY
In other languages
有人在拆 Transformer:Memory Caching 與 CTM 各拆走了一半
dev.to
·
1w
1 week ago
·
DEV
Actions for 有人在拆 Transformer:Memory Caching 與 CTM 各拆走了一半
Aandacht is alles wat je nodig hebt
janvandenberg.blog
·
2w
2 weeks ago
Actions for Aandacht is alles wat je nodig hebt
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Dislike
Report