Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
🤖 Transformers
Specific
Attention Mechanism, BERT, GPT, Sequence Modeling, Self-Attention
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
199
posts in
7.8
ms
AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence
🤖
LLMs
techradar.com
·
6d
6 days ago
Actions for AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence
Less-relevant results
defai-digital/ax-engine: Apple Silicon
LLM
runtime supporting Gemma 4 and Qwen 3.6 MTP
modes
🤖
LLMs
Content type:
Code
github.com
·
22h
22 hours ago
·
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
Guardian Angels:
LLM
Personalization for Productivity and Security
🔧
Developer Tools
gwern.net
·
3d
3 days ago
·
Hacker News
Actions for Guardian Angels: LLM Personalization for Productivity and Security
The Edge
LLM
Offload Story
🤖
AI
semiengineering.com
·
6d
6 days ago
Actions for The Edge LLM Offload Story
Towards Tight Bounds for Streaming
Attention
🧠
Deep Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Towards Tight Bounds for Streaming Attention
Hugging
Face
Transformers
RCE flaw enables stealthy compromise via AI
model
configs
🔄
DevOps
csoonline.com
·
6d
6 days ago
Actions for Hugging Face Transformers RCE flaw enables stealthy compromise via AI model configs
What an
LLM
Actually Does With Your Prompt First
🤖
LLMs
siliconopera.com
·
5d
5 days ago
Actions for What an LLM Actually Does With Your Prompt First
Introducing Granite Libraries and Project Granite Switch
🤖
LLMs
Content type:
Blog
research.ibm.com
·
6d
6 days ago
·
Hacker News
Actions for Introducing Granite Libraries and Project Granite Switch
NVIDIA releases Nemotron 3 Ultra, claiming five times the speed and 30 percent lower costs than prior modelsThe
model
delivers 300 tokens per second on benchmar...
📝
NLP
digg.com
·
6d
6 days ago
Actions for NVIDIA releases Nemotron 3 Ultra, claiming five times the speed and 30 percent lower costs than prior modelsThe model delivers 300 tokens per second on benchmar...
Contribution Weights: A Geometrical Analysis of
Self-Attention
Transformers
🤖
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Contribution Weights: A Geometrical Analysis of Self-Attention Transformers
You’ve Been Using AI for Years. You Just Didn’t Call It That.
🤖
LLMs
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for You’ve Been Using AI for Years. You Just Didn’t Call It That.
Issue #390 - The ML Engineer 🤖
🤖
Machine Learning
Content type:
News
Content type:
Blog
machinelearning.substack.com
·
3d
3 days ago
·
Substack
Actions for Issue #390 - The ML Engineer 🤖
RightNow-AI/AutoMegaKernel: An agent harness that compiles a
model
into one provably-correct,
self-retargeting
CUDA megakernel and
self-tunes
it past cuBLAS at batch-1
LLM
decode.
🤖
LLMs
Content type:
Code
github.com
·
2d
2 days ago
·
Hacker News
Actions for RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.
DeepSeek V4, LeCun's Bet Against LLMs, and Lovable's
Self-Improving
Agent - The Tokenizer Edition #30
🤖
Machine Learning
newsletter.artofsaience.com
·
6d
6 days ago
Actions for DeepSeek V4, LeCun's Bet Against LLMs, and Lovable's Self-Improving Agent - The Tokenizer Edition #30
Building Semantic Search with
Transformers.js
and Sentence Embeddings
🤖
AI
machinelearningmastery.com
·
5d
5 days ago
Actions for Building Semantic Search with Transformers.js and Sentence Embeddings
Chiaroscuro
Attention
: Spending Compute in the Dark
📈
Optimization
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Chiaroscuro Attention: Spending Compute in the Dark
What Does Abliteration Actually Cost?
📝
NLP
lesswrong.com
·
5d
5 days ago
Actions for What Does Abliteration Actually Cost?
My research agenda and work
🤖
LLMs
lesswrong.com
·
5d
5 days ago
Actions for My research agenda and work
nex-agi/Nex-N2-mini •
Huggingface
📝
NLP
huggingface.co
·
6d
6 days ago
·
r/LocalLLaMA
Actions for nex-agi/Nex-N2-mini • Huggingface
BioMedGraphica: An All-in-One Platform for Joint Textual Biomedical Prior Knowledge and Numeric Graph Generation
🗂️
Data Structures
academic.oup.com
·
5d
5 days ago
Actions for BioMedGraphica: An All-in-One Platform for Joint Textual Biomedical Prior Knowledge and Numeric Graph Generation
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help