Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformer Models
🔄 Transformer Models
Specific
Attention Mechanism, BERT, GPT Architecture, Self-Attention
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
503
posts in
7.5
ms
google/gemma-4-12B-it-qat-q4_0-gguf
🤖
llm
huggingface.co
·
5d
5 days ago
Actions for google/gemma-4-12B-it-qat-q4_0-gguf
SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation
🗄️
Vector Databases
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation
Your Hong Kong weekend food guide for June 12-14
🤖
llm
Content type:
News
scmp.com
·
1d
1 day ago
·
r/SCMPauto
Actions for Your Hong Kong weekend food guide for June 12-14
Context windows in AI: why every token is a budget decision
🤖
llm
Content type:
Blog
redis.io
·
20h
20 hours ago
Actions for Context windows in AI: why every token is a budget decision
Apples to Apples: MLX vs.
Llama.cpp
for Gemma 4 12B on an M1 16GB
🤖
llm
Content type:
Blog
ziraph.com
·
5d
5 days ago
·
Hacker News
Actions for Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB
146th airhacks tv: Rust, Java 25, AI Agents, BCE, Web Components, zunit, zb
🤖
llm
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for 146th airhacks tv: Rust, Java 25, AI Agents, BCE, Web Components, zunit, zb
Apple WWDC On-Device AI Deep Dive - Google Docs
🤖
llm
gist.is
·
16h
16 hours ago
·
Hacker News
Actions for Apple WWDC On-Device AI Deep Dive - Google Docs
markusheimerl/gpt
: A generative pretrained
transformer
implementation
🤖
llm
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for markusheimerl/gpt: A generative pretrained transformer implementation
What shapes your power bill? Explainable AI outlines forecasts behind grid and price decisions
🤖
AI coding
techxplore.com
·
2d
2 days ago
Actions for What shapes your power bill? Explainable AI outlines forecasts behind grid and price decisions
gist:5b74b8c31e934ff50ce57aa653a343d5
🤖
llm
gist.github.com
·
11h
11 hours ago
·
r/LocalLLaMA
Actions for gist:5b74b8c31e934ff50ce57aa653a343d5
BeeLlama.cpp DFlash on Strix Halo: 2.7x Gemma 31B, But MTP Is Still Faster
🤖
llm
sleepingrobots.com
·
4d
4 days ago
Actions for BeeLlama.cpp DFlash on Strix Halo: 2.7x Gemma 31B, But MTP Is Still Faster
[NEW
MODEL
] SupraLabs just released Supra1.5-50M Base (Experimental)!
🤖
llm
huggingface.co
·
2h
2 hours ago
·
r/LocalLLaMA
Actions for [NEW MODEL] SupraLabs just released Supra1.5-50M Base (Experimental)!
NBC Nightly News With Tom
Llamas
: KNTV : June 4, 2026 4:00pm-4:30pm PDT : Free Borrow & Streaming
🤖
llm
Content type:
Video
archive.org
·
6d
6 days ago
Actions for NBC Nightly News With Tom Llamas : KNTV : June 4, 2026 4:00pm-4:30pm PDT : Free Borrow & Streaming
Tech jobs: Specialist subject
🤖
AI coding
emerging-europe.com
·
9h
9 hours ago
Actions for Tech jobs: Specialist subject
A machine-learning-based reconstruction of surface mass balance over the Greenland Ice Sheet from 1950 to 2020
🤖
AI coding
Content type:
Academic
nature.com
·
2d
2 days ago
Actions for A machine-learning-based reconstruction of surface mass balance over the Greenland Ice Sheet from 1950 to 2020
Claude Now Writes 80% of Its Own Code — Anthropic's
Self-Improvement
Milestone Arrives Faster Than Expected
🤖
AI coding
the-agent-report.com
·
2d
2 days ago
·
DEV
Actions for Claude Now Writes 80% of Its Own Code — Anthropic's Self-Improvement Milestone Arrives Faster Than Expected
The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has
🤖
llm
xda-developers.com
·
21h
21 hours ago
Actions for The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has
Improved performance and
model
support with GGUF
🤖
llm
Content type:
Blog
ollama.com
·
6d
6 days ago
Actions for Improved performance and model support with GGUF
Reachability and asymptotics of Gaussian
Transformer
dynamics
🤖
llm
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Reachability and asymptotics of Gaussian Transformer dynamics
"AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
🤖
llm
Content type:
News
Content type:
Blog
braddelong.substack.com
·
2d
2 days ago
·
Substack
Actions for "AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help