Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
🤖 Transformers
Specific
Attention Mechanism, BERT, GPT Architecture, Neural Networks
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
161
posts in
5.4
ms
What an LLM Actually Does With Your Prompt First
✍️
Prompt Engineering
siliconopera.com
·
6d
6 days ago
Actions for What an LLM Actually Does With Your Prompt First
I stopped using most of Rust’s advanced features for my ML library
🤖
AI
Content type:
Code
github.com
·
2d
2 days ago
·
r/rust
Actions for I stopped using most of Rust’s advanced features for my ML library
Wall
Attention
: Length Generalization With Diagonal Gates | Tilde
🪟
Context Windows
Content type:
Blog
blog.tilderesearch.com
·
23h
23 hours ago
Actions for Wall Attention: Length Generalization With Diagonal Gates | Tilde
Tokenminning: Because Tokenmaxxing Is a Bad Idea
✍️
Prompt Engineering
tokenminning.com
·
1d
1 day ago
·
Hacker News
Actions for Tokenminning: Because Tokenmaxxing Is a Bad Idea
PENet+: A Lightweight Residual
Transformer
Framework for Efficient Image Steganalysis
⚡
Inference Optimization
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for PENet+: A Lightweight Residual Transformer Framework for Efficient Image Steganalysis
Using local LLMs for agentic coding
🦙
Llama
Content type:
Blog
blog.alexewerlof.com
·
6d
6 days ago
Actions for Using local LLMs for agentic coding
SLUUG Talk: Demystifying
Large
Language
Models
on Linux
🤖
AI
Content type:
Code
github.com
·
4d
4 days ago
·
DEV
Actions for SLUUG Talk: Demystifying Large Language Models on Linux
Look Less, Reason More: Block-wise
Attention
Skipping for Efficient
Multimodal
LLMs
🤖
LLM
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Look Less, Reason More: Block-wise Attention Skipping for Efficient Multimodal LLMs
The Sequence Radar #873: Last Week in AI: Soccer, S-1s, and Supermodels
🤖
Agent
Content type:
News
Content type:
Blog
thesequence.substack.com
·
3d
3 days ago
·
Substack
Actions for The Sequence Radar #873: Last Week in AI: Soccer, S-1s, and Supermodels
Train your own
GPT-2
(124M).
🐍
Python
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for Train your own GPT-2 (124M).
Chiaroscuro
Attention
: Spending Compute in the Dark
🪟
Context Windows
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Chiaroscuro Attention: Spending Compute in the Dark
google/gemma-4-12B-it-qat-q4_0-gguf
⚡
Inference Optimization
huggingface.co
·
5d
5 days ago
Actions for google/gemma-4-12B-it-qat-q4_0-gguf
Anthropic: Claude Now Writes 80% of Its Own Code in 2026
🎭
Anthropic Claude
Content type:
Blog
wowhow.cloud
·
2d
2 days ago
·
DEV
Actions for Anthropic: Claude Now Writes 80% of Its Own Code in 2026
Handshake: Partner-Specific Protein-Protein Binding Site Prediction at Scale Using ProstT5 and Cross-Chain
Attention
🎯
Fine-tuning
Content type:
Academic
biorxiv.org
·
4d
4 days ago
Actions for Handshake: Partner-Specific Protein-Protein Binding Site Prediction at Scale Using ProstT5 and Cross-Chain Attention
Beyond Patches: Superpixel Token-based
Transformers
for Attribute-Specific Fashion Retrieval
🔍
RAG
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Beyond Patches: Superpixel Token-based Transformers for Attribute-Specific Fashion Retrieval
How Will the
Multimodal
AI Market Grow Through 2034 Amid Emerging Trends and Business Strategies?
🧠
OpenAI
Content type:
Blog
semiconinsights.wordpress.com
·
5d
5 days ago
Actions for How Will the Multimodal AI Market Grow Through 2034 Amid Emerging Trends and Business Strategies?
TextEconomizer: Enhancing Lossy Text Compression with Denoising
Transformers
and Entropy Coding
⚡
Inference Optimization
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for TextEconomizer: Enhancing Lossy Text Compression with Denoising Transformers and Entropy Coding
See, Act, Correct: three levers for working with a code agent
🎮
Reinforcement Learning
Content type:
Blog
blog.owulveryck.info
·
6d
6 days ago
·
Hacker News
,
Hacker News
Actions for See, Act, Correct: three levers for working with a code agent
We Taught a
Model
to Speak Legalese. Here’s What Changed.
🧠
OpenAI
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for We Taught a Model to Speak Legalese. Here’s What Changed.
Introducing the Third Generation of Apple’s Foundation
Models
🤖
LLM
machinelearning.apple.com
·
3d
3 days ago
·
Hacker News
,
r/apple
Actions for Introducing the Third Generation of Apple’s Foundation Models
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help