Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
👁️ Attention Optimization
Flash Attention, Memory Efficient, Sparse Attention, Transformers
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82763
posts in
1.13
s
Attention
Optimization
aussieai.com
·
5d
🧩
Attention Kernels
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
HySparse
: A Hybrid Sparse Attention Architecture with Oracle Token Selection and
KV
Cache Sharing
arxiv.org
·
6h
🧩
Attention Kernels
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Neural Attention Search
Linear
: Towards Adaptive
Token-Level
Hybrid Attention Models
arxiv.org
·
6h
🧩
Attention Kernels
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
How
Transformers
Pay Attention Like
Humans
Do
dev.to
·
2d
·
Discuss:
DEV
🧩
Attention Kernels
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Physics 1 – Attention can’t exactly
simulate
uniform
linear motion
kindxiaoming.github.io
·
22h
🧩
Attention Kernels
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
How
Transformers
Architecture
Powers
Modern LLMs
blog.bytebytego.com
·
1d
🧩
Attention Kernels
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
**Abstract:** This paper introduces a novel approach to Neural Architecture Search (NAS) specifically
tailored
for
resource-constrained
edge AI vision system...
freederia.com
·
3h
⚡
ONNX Runtime
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Training Design for Text-to-Image Models:
Lessons
from
Ablations
huggingface.co
·
23h
📊
Gradient Accumulation
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Edit
Mind
: AI-Powered Local Video Search & Analysis
producthunt.com
·
13h
⚡
Flash Attention
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
The Proximity of the
Inception
Score as an Evaluation
Criterion
towardsdatascience.com
·
23h
📊
Gradient Accumulation
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
7 Advanced Feature Engineering
Tricks
Using LLM
Embeddings
machinelearningmastery.com
·
19h
📉
Model Quantization
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
A New AI Architecture Without Prior
Distributions
: Stream-Based AI and
Compositional
Inference
dev.to
·
1d
·
Discuss:
DEV
⚡
Flash Attention
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Shows
Convergence
Of Language, Vision And Action Representations At 0.73
Similarity
quantumzeitgeist.com
·
1h
🧩
Attention Kernels
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
slow
abstraction
steel-water.bearblog.dev
·
4h
🐕
Ruff
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Why Focus Still
Matters
in a
Distracted
World
talkflow.substack.com
·
18h
·
Discuss:
Substack
⚡
Flash Attention
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
The Core
Flaws
of Modern AI based on Large Language Models (
longpost
)
bykozy.me
·
21h
·
Discuss:
Hacker News
📊
Gradient Accumulation
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Using
Nsight
Compute with large
codebases
- Part 2 : Profiling large code bases
blog.ncompass.tech
·
18h
·
Discuss:
Hacker News
🔍
Nsight
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
New AI
Quiz
Generator
learvo.com
·
16h
·
Discuss:
Hacker News
🤖
AI Coding Tools
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Future
leakage
in
block-quantized
attention
matx.com
·
1d
·
Discuss:
Hacker News
📉
Model Quantization
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Beyond Two
Towers
:
Re-architecting
the Serving Stack for Next-Gen Ads Lightweight Ranking Models…
medium.com
·
1d
⚡
ONNX Runtime
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help