Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🤖 Transformers
Attention Mechanism, BERT, GPT, Language Models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
6933
posts in
423.5
ms
So
whats
the next word, then? Almost-no-math
intro
to transformer models
matthias-kainer.de
·
1d
·
Discuss:
Hacker News
💬
Natural Language Processing
Attention
Retention
for
Continual
Learning with Vision Transformers
arxiv.org
·
1d
🧠
Deep Learning
In (highly
contingent
!) defense of
interpretability-in-the-loop
ML training
lesswrong.com
·
19h
∂
Automatic Differentiation
A Neuro Symbolic Architecture For Induced
Epistemic
Agency and System 2 Reasoning in
Quantized
Large Language Models
papers.ssrn.com
·
1d
·
Discuss:
Hacker News
🗣️
Large Language Models
GABRIEL – turn messy
qualitative
corpora
into analysis-ready datasets
github.com
·
1d
·
Discuss:
Hacker News
🗣️
Large Language Models
Sequential Attention: Making AI models
leaner
and faster without
sacrificing
accuracy
research.google
·
2d
·
Discuss:
Hacker News
,
r/LocalLLaMA
🧠
Deep Learning
Text classification with Python 3.14's
zstd
module • Max
Halford
maxhalford.github.io
·
1d
·
Discuss:
Lobsters
,
Hacker News
🗣️
Large Language Models
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
1d
·
Discuss:
Hacker News
∂
Automatic Differentiation
Turning Coding
Tasks
into Feedback
Loops
feipeng.substack.com
·
1d
·
Discuss:
Substack
∂
Automatic Differentiation
Transformers Are Born Biased: Structural
Inductive
Biases at Random
Initialization
and Their Practical Consequences
arxiv.org
·
1d
🗣️
Large Language Models
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
21h
·
Discuss:
Hacker News
🗣️
Large Language Models
Continual
learning and the post
monolith
AI era
baseten.co
·
13h
·
Discuss:
Hacker News
🧠
Deep Learning
The control
layer
for AI
blog.dottxt.ai
·
11h
·
Discuss:
Hacker News
🗣️
Large Language Models
ML-LIB
: Machine Learning Library Proposed For The Linux Kernel
phoronix.com
·
16h
·
Discuss:
Hacker News
🗣️
Large Language Models
NVIDIA
Transformer
Engine
docs.nvidia.com
·
1d
·
Discuss:
Hacker News
🔥
PyTorch
Building Highly Efficient Inference System for
Recommenders
Using
PyTorch
pytorch.org
·
1d
·
Discuss:
Hacker News
🔥
PyTorch
Training a Small Language Model
elijahpotter.dev
·
4d
·
Discuss:
Hacker News
🗣️
Large Language Models
Field theory and the rise of
ambition
machines
fieldtheory.dev
·
17h
·
Discuss:
Hacker News
💬
Natural Language Processing
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
11h
·
Discuss:
Hacker News
∂
Automatic Differentiation
Agentic Productivity System with
Plain
Markdown
sattlerjoshua.com
·
1d
·
Discuss:
Hacker News
📈
Data Visualization
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help