Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
🤖 Transformers
Specific
Attention Mechanism, BERT, GPT, Language Models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
561
posts in
12.8
ms
markusheimerl/gpt
: A generative pretrained
transformer
implementation
📊
Optimization
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for markusheimerl/gpt: A generative pretrained transformer implementation
Google open-sources speedy DiffusionGemma text diffusion
model
📝
NLP
siliconangle.com
·
15h
15 hours ago
Actions for Google open-sources speedy DiffusionGemma text diffusion model
LLM-Based Code Documentation Generation and Multi-Judge Evaluation
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for LLM-Based Code Documentation Generation and Multi-Judge Evaluation
high-performance classification API (beats
GPT-5.4-mini
)
💬
Prompt Engineering
Content type:
Discussion
classer.ai
·
3h
3 hours ago
·
Hacker News
Actions for high-performance classification API (beats GPT-5.4-mini)
DiffusionGemma 26B A4B results on my 5090
💬
Prompt Engineering
huggingface.co
·
1d
1 day ago
·
r/LocalLLaMA
Actions for DiffusionGemma 26B A4B results on my 5090
The
Transformer
Architecture
: A Step-by-Step Guide
🎭
Anthropic Claude
Content type:
Blog
m7mdelyoussef.medium.com
·
19h
19 hours ago
Actions for The Transformer Architecture: A Step-by-Step Guide
local llm on laptop 780M GPU using
llama
+ gemma 4 qat
🦙
Ollama
Content type:
Blog
alper.bearblog.dev
·
5d
5 days ago
Actions for local llm on laptop 780M GPU using llama + gemma 4 qat
New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
💬
Prompt Engineering
Content type:
Discussion
news.ycombinator.com
·
5h
5 hours ago
·
Hacker News
Actions for New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
LeLab Is
Hugging
Face
’s New Browser-Based GUI for the LeRobot Ecosystem
📝
NLP
Content type:
News
hackster.io
·
2d
2 days ago
Actions for LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem
Making a Vintage LLM from Scratch
🤖
n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt
crlf.link
·
7h
7 hours ago
·
Hacker News
Actions for Making a Vintage LLM from Scratch
Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens
💬
Prompt Engineering
Content type:
Blog
medium.com
·
13h
13 hours ago
Actions for Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens
Attention
Based Interpretability With Concept
Transformer
🧮
Embeddings
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Attention Based Interpretability With Concept Transformer
ELI5 is a terrible learning
prompt
, here's the structural reason it fails and a 4-level replacement that actually sticks
💬
Prompt Engineering
Content type:
Blog
Content type:
Tutorial
appliedaihub.org
·
1d
1 day ago
·
r/PromptEngineering
Actions for ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks
Dr. Ashish Bamania (@drashishbamania)
📝
NLP
substack.com
·
5h
5 hours ago
·
Substack
Actions for Dr. Ashish Bamania (@drashishbamania)
A deep learning framework for emotion recognition in music using multimodal data fusion
👁️
Computer Vision
Content type:
Academic
nature.com
·
16h
16 hours ago
Actions for A deep learning framework for emotion recognition in music using multimodal data fusion
MTP Isn't Always a Win: 1.95x on My 3090, but Speculative
Decoding
Is Hardware-Dependent
🦙
Ollama
Content type:
Blog
bric.pe.kr
·
2d
2 days ago
·
DEV
Actions for MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
How LLMs work | Practical Leaders
💬
Prompt Engineering
practical-leaders.com
·
6d
6 days ago
·
Hacker News
Actions for How LLMs work | Practical Leaders
AI 101: From
Prompt
Engineering
to Skill
Engineering
💬
Prompt Engineering
turingpost.com
·
19h
19 hours ago
Actions for AI 101: From Prompt Engineering to Skill Engineering
Train
Models
Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
📊
Performance Tools
Content type:
News
Content type:
Blog
developer.nvidia.com
·
2d
2 days ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
Malicious
Hugging
Face
Models
Could Trigger Remote Code Execution
📝
NLP
techrepublic.com
·
5d
5 days ago
Actions for Malicious Hugging Face Models Could Trigger Remote Code Execution
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help