Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
⚙️ Transformers
Specific
transformer architecture, attention mechanism, BERT, encoder
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
228
posts in
7.1
ms
markusheimerl/gpt
: A generative pretrained
transformer
implementation
✨
Generative AI
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for markusheimerl/gpt: A generative pretrained transformer implementation
The Sequence Knowledge #874:
Transformers
or Not?
🧠
Machine Learning
substackcdn.com
·
23h
23 hours ago
·
Substack
Actions for The Sequence Knowledge #874: Transformers or Not?
Towards Tight Bounds for Streaming
Attention
🔍
RAG
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Towards Tight Bounds for Streaming Attention
Big Blue’s Redbook on Storage Scale
KV
Cache
management
📝
LLMs
Content type:
News
blocksandfiles.com
·
17h
17 hours ago
Actions for Big Blue’s Redbook on Storage Scale KV Cache management
The Memory Problem is Solved: How Google’s Memory
Caching
Makes RNNs Smart Again
📱
Edge AI
Content type:
Blog
medium.com
·
1d
1 day ago
Actions for The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again
How we fight GPU scarcity without compromise
🤖
AI
Content type:
Blog
equixly.com
·
5d
5 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
Less-relevant results
I Built a Collection of 100+ Free Developer Tools That Run Entirely in the Browser
📝
LLMs
solutiontoolkit.com
·
1d
1 day ago
·
DEV
Actions for I Built a Collection of 100+ Free Developer Tools That Run Entirely in the Browser
The
Transformer
, Demystified — Let's Actually Build One
🔗
Deep Learning
Content type:
News
mlwhiz.com
·
4d
4 days ago
Actions for The Transformer, Demystified — Let's Actually Build One
3-Part Series: LLM Latency in Production (Part 1)
🤖
AI
towardsai.net
·
6d
6 days ago
Actions for 3-Part Series: LLM Latency in Production (Part 1)
VelocityFM: Short-Horizon Protein Trajectory Prediction via Flow Matching in Velocity Space
🧬
Bioinformatics
Content type:
Academic
biorxiv.org
·
2d
2 days ago
Actions for VelocityFM: Short-Horizon Protein Trajectory Prediction via Flow Matching in Velocity Space
Hugging Face
Transformers
RCE
flaw
enables stealthy compromise via AI
model
configs
🛡️
AI Safety
csoonline.com
·
5d
5 days ago
Actions for Hugging Face Transformers RCE flaw enables stealthy compromise via AI model configs
Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented
CNN-transformer
model
🧠
Machine Learning
Content type:
Academic
nature.com
·
5d
5 days ago
Actions for Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model
PagedAttention vs Traditional
KV
Cache
: How vLLM Reinvented GPU Memory for LLM Inference
🤖
AI
Content type:
Blog
medium.com
·
1d
1 day ago
Actions for PagedAttention vs Traditional KV Cache: How vLLM Reinvented GPU Memory for LLM Inference
Gated DeltaNet, From First Principles
🔗
Deep Learning
Content type:
Blog
sankalp.bearblog.dev
·
17h
17 hours ago
Actions for Gated DeltaNet, From First Principles
How LLMs Actually Work: A Friendly Map for Humans • oreoro
📝
LLMs
oreoro.github.io
·
4d
4 days ago
·
Hacker News
Actions for How LLMs Actually Work: A Friendly Map for Humans • oreoro
What the ocean taught me about AI.
✨
Generative AI
Content type:
Blog
medium.com
·
1d
1 day ago
Actions for What the ocean taught me about AI.
linzhiqiu/t2v_metrics: Evaluating text-to-image/video/3D
models
with VQAScore
🎨
Diffusion Models
Content type:
Code
github.com
·
17h
17 hours ago
·
Hacker News
Actions for linzhiqiu/t2v_metrics: Evaluating text-to-image/video/3D models with VQAScore
Beyond Patches: Superpixel Token-based
Transformers
for Attribute-Specific Fashion Retrieval
🔍
RAG
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Beyond Patches: Superpixel Token-based Transformers for Attribute-Specific Fashion Retrieval
Breaking tunnel vision, imaging AI lifts fluorescence image restoration accuracy and speed
🧠
Machine Learning
phys.org
·
1d
1 day ago
Actions for Breaking tunnel vision, imaging AI lifts fluorescence image restoration accuracy and speed
AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence
✨
Generative AI
techradar.com
·
5d
5 days ago
Actions for AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help