Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
👁️ Attention Optimization
Flash Attention, Memory Efficient, Sparse Attention, Transformers
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
122321
posts in
711.9
ms
Learning to
Remember
, Learn, and
Forget
in Attention-Based Models
arxiv.org
·
1d
🧩
Attention Kernels
LOTFormer
:
Doubly-Stochastic
Linear Attention via Low-Rank Optimal Transport
arxiv.org
·
2d
⚡
Flash Attention
Image Classification with
CNNs
– Part 3: Understanding Max
Pooling
and Results
dev.to
·
28m
·
Discuss:
DEV
🧮
cuDNN
Google
Deepmind
upgrades
Gemini 3 Deep Think for complex science and engineering tasks
the-decoder.com
·
3h
🎯
Tensor Cores
Digitizing
the "
Shokunin
": How we encoded a Master's hammer strike into AI
yusukekaizen.substack.com
·
14h
·
Discuss:
Substack
📉
Model Quantization
New Generative
Paradigm
:
Drifting
Model
mail.bycloud.ai
·
2d
📊
Gradient Accumulation
My Go-To AI Tools: February 2026 Update
whytryai.com
·
10h
🤖
AI Coding Tools
Show HN: The
Algorithm
's Favorite Child
chatbotkit.com
·
5h
·
Discuss:
Hacker News
⚡
ONNX Runtime
Quality and
understandability
after AI
federicopereiro.com
·
10h
·
Discuss:
Hacker News
🤖
AI Coding Tools
The 4 Mixture of Experts Architectures: How to Train
100B
Models at
10B
Cost
pub.towardsai.net
·
7h
🎓
Model Distillation
Transform
your look with the power of AI
tryaibeauty.com
·
18h
·
Discuss:
Hacker News
🧩
Attention Kernels
Moltis
: Rust based AI assistant with memory, tools, and
self-extending
skills
moltis.org
·
1h
·
Discuss:
Hacker News
🤖
AI Coding Tools
UbiquitousLearning/mllm
: Fast Multimodal LLM on Mobile Devices
github.com
·
11h
🏎️
TensorRT
Cognitive
Training
Platforms
trendhunter.com
·
1d
🧠
BF16
A C implementation of the inference pipeline for the Mistral AI’s
Voxtral
Realtime
4B model
blog.adafruit.com
·
4h
🏎️
TensorRT
Cognitive
Illusion
: Why AI Still Can’t Think Like a Human
neurosciencenews.com
·
12m
🧩
Attention Kernels
How
Andrej
Karpathy
Built a Working Transformer in 243 Lines of Code
analyticsvidhya.com
·
7h
📜
TorchScript
Transformer-Based Memory Forecasting: Leveraging
Anonymized
Aggregates
for Personal Insights
novice.media
·
23h
·
Discuss:
Hacker News
⚡
Flash Attention
Show HN:
ProductFront-Streamlined
product discovery platform for maximum exposure
productfront.tech
·
15h
·
Discuss:
Hacker News
🤖
AI Coding Tools
Recursive
Language Models: Stop
Stuffing
the Context Window
nlp.elvissaravia.com
·
51m
⚡
ONNX Runtime
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help