Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃 Transformers
Specific
Attention Mechanism, Self-Attention, BERT, Architecture
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
181569
posts in
35.3
ms
Transformers in
NLP
: How Self-Attention Replaced
Recurrence
and Changed Everything
聽
馃挰
LLMs
medium.com
路
1d
Unveiling
the Hidden Structure of Self-Attention via Kernel
Principal
Component Analysis
聽
馃
Machine Learning
proceedings.neurips.cc
路
9h
Nexusformer
: Nonlinear Attention Expansion for Stable and
Inheritable
Transformer Scaling
聽
馃
LLM
arxiv.org
路
3d
Transformers
聽
馃挰
LLMs
chizkidd.github.io
路
2d
路
Hacker News
The Second Half of Model Architecture
聽
馃挕
AI Reasoning
lh-zhu.github.io
路
5d
Watch language models think.
聽
馃挰
LLMs
openinterp.org
路
1d
路
Hacker News
What is an LLM? A Guide on Large Language Models and How They Work
聽
馃
LLM
datacamp.com
路
2d
Transformers
are Just an Expensive While
Loop
聽
馃
LLM
medium.com
路
5d
BranchyNet
:
Teaching
Neural Networks When to Stop Thinking
聽
馃挕
AI Reasoning
medium.com
路
1d
Are AI Companions Real Companions? A
BERT
鈥怋ased Study of
Replika
Reviews
聽
馃
AI
onlinelibrary.wiley.com
路
4d
Machine learning-driven alignment architecture of heterogeneous data with
transient
varying
semantics
聽
馃
Machine Learning
nature.com
路
2d
The
Sequence
Knowledge #846: Beyond
Transformer
: A New Series
聽
馃挰
LLMs
substackcdn.com
路
4d
路
Substack
kyegomez/OpenMythos
: A theoretical reconstruction of the Claude Mythos architecture, built from first principles using the available research literature.
聽
馃
LLM
github.com
路
5d
路
Hacker News
My 12-Week Plan to Actually
Understand
AI
聽
馃挕
AI Reasoning
shobhitshri.substack.com
路
5d
路
Substack
Cognitive Alignment At No Cost:
Inducing
Human Attention
Biases
For Interpretable Vision Transformers
聽
馃
Machine Learning
arxiv.org
路
2d
Building a centralized AI tool
aggregator
: architecture, API normalization, and latency
tradeoffs
聽
馃挕
AI Reasoning
onlyaitools.lovable.app
路
6d
路
DEV
How AI Is
Evolving
: From Large Language Models to
Agentic
Intelligence
聽
馃挕
AI Reasoning
en.tempo.co
路
4d
Transformer vs
CNN-LSTM
:
CWRU
Bearing 96% vs 92% Accuracy
聽
馃
LLM
tildalice.io
路
4d
The
Recurrent
Transformer:
Greater
Effective Depth and Efficient Decoding
聽
馃
LLM
arxiv.org
路
1d
Emergence
Transformer:
Dynamical
Temporal Attention Matters
聽
馃挰
LLMs
arxiv.org
路
2d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help