Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🤖 Transformers
Attention Mechanism, Self-Attention, BERT, Architecture
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
174266
posts in
9.3
ms
Affine-Scaled
Attention: Towards Flexible and Stable Transformer Attention
arxiv.org
·
1d
💬
LLMs
Semantic search and
retrieval
using
transformers
thoughtworks.com
·
23h
💬
LLMs
Deep Dive into
Transformer
Encoders
by Hand ✍️
pub.towardsai.net
·
17h
🔥
PyTorch
On language models and
intuition
aleksei.dev
·
2h
💬
LLMs
The Day an AI Said "Left Brain" — Dependent
Origination
in Transformer
Self-Description
zenodo.org
·
12h
·
Discuss:
DEV
💬
LLMs
MetaOthello
: A
Controlled
Study of Multiple World Models in Transformers
arxiv.org
·
1d
🧠
LLM
Full
Transformer
Block — Deep
Dive
+ Problem: List Operations
dev.to
·
6h
·
Discuss:
DEV
🧠
LLM
Attention-related modulation in the superior
colliculus
encodes
perceptual sensitivity, but not perceptual choice
nature.com
·
31m
📡
Information Theory
douglas-larocca/name-classifier
: A high-performance name classifier that
infers
probabilistic attributes about a person from their name alone.
github.com
·
3h
·
Discuss:
Hacker News
💬
LLMs
Transformers Have Computational
Signatures
Orthogonal
to Semantic Content
lesswrong.com
·
2d
🧠
LLM
Units
of
attention
thoughtshrapnel.com
·
2d
📡
Information Theory
Microsoft’s
Graphormer
: The Transformer That Finally Beats
GNNs
hackernoon.com
·
6h
💬
LLMs
Evaluating
LLMs using semantic
entropy
research.thoughtworks.com
·
23h
💬
LLMs
How Attention, Context and
Routing
Shape Modern AI Models (A Systems Deep
Dive
)
dev.to
·
1d
·
Discuss:
DEV
💬
LLMs
Unifying
Arabic
topolects
through AI
languagelog.ldc.upenn.edu
·
57m
🤖
AI
Why I Built a
Masked
Autoencoder
(MAE) from Scratch (And How You Can Too)
pub.towardsai.net
·
1d
🔥
PyTorch
AI
Learns
To
Self-Correct
And Reduce False Claims Using Internal Knowledge
quantumzeitgeist.com
·
2d
💬
LLMs
Asura
:
Looped
Language Models done better
neel04.github.io
·
2d
·
Discuss:
Hacker News
🧠
LLM
Attention
BAs
: All about Feature
Injection
thoughtworks.com
·
23h
🤖
AI
How brain networks work
together
is key to human intelligence
futurity.org
·
3h
📡
Information Theory
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help