Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
🤖 Transformers
Specific
Attention Mechanism, BERT, GPT Architecture, Sequence Models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
187
posts in
6.6
ms
markusheimerl/gpt
: A generative pretrained
transformer
implementation
📝
Natural Language Processing
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for markusheimerl/gpt: A generative pretrained transformer implementation
know the mother tongue of your LLMs
🤖
LLM
mothertoken.inigoimaz.com
·
2d
2 days ago
·
Hacker News
Actions for know the mother tongue of your LLMs
SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation
📝
Natural Language Processing
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation
OCOO-T : A SIMPLE AND SCALABLE VIRTUAL CELL
MODEL
FOR TRANSCRIPTIONAL PERTURBATION RESPONSE PREDICTION
🗄️
Vector Databases
Content type:
Academic
biorxiv.org
·
5h
5 hours ago
Actions for OCOO-T : A SIMPLE AND SCALABLE VIRTUAL CELL MODEL FOR TRANSCRIPTIONAL PERTURBATION RESPONSE PREDICTION
The
Sequence
Knowledge #874:
Transformers
or Not?
⛓️
LangChain
substackcdn.com
·
2d
2 days ago
·
Substack
Actions for The Sequence Knowledge #874: Transformers or Not?
Your
LLM
Isn’t Reading Your Manners — It’s Counting Your Tokens
🤖
LLM
Content type:
Blog
medium.com
·
20h
20 hours ago
Actions for Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens
How we fight GPU scarcity without compromise
📝
Natural Language Processing
Content type:
Blog
equixly.com
·
6d
6 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks
🤖
AI
Content type:
Blog
Content type:
Tutorial
appliedaihub.org
·
1d
1 day ago
·
r/PromptEngineering
Actions for ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks
Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon
📝
Natural Language Processing
xda-developers.com
·
7h
7 hours ago
Actions for Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon
The
Transformer
, Demystified — Let's Actually Build One
🤖
AI
Content type:
News
mlwhiz.com
·
6d
6 days ago
Actions for The Transformer, Demystified — Let's Actually Build One
Markov Chains: The Grandparents of LLMs
📝
Natural Language Processing
dmanco.dev
·
1d
1 day ago
·
Hacker News
Actions for Markov Chains: The Grandparents of LLMs
Less-relevant results
Google open-sources speedy DiffusionGemma text diffusion
model
🤖
AI
siliconangle.com
·
22h
22 hours ago
Actions for Google open-sources speedy DiffusionGemma text diffusion model
Don't let the
LLM
speak, just probe it (8 minute read)
🤖
AI
Content type:
Blog
blog.j11y.io
·
23h
23 hours ago
Actions for Don't let the LLM speak, just probe it (8 minute read)
Visual Artist and Percussionist Bob
Bert
(Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)
🚀
MLOps
glidemagazine.com
·
2d
2 days ago
Actions for Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)
Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented
CNN-transformer
model
🤖
AI
Content type:
Academic
nature.com
·
6d
6 days ago
Actions for Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model
Machine learning from scratch, what to build before using scikit-learn
🧠
Machine Learning
Content type:
Tutorial
iwtlp.com
·
1d
1 day ago
·
DEV
Actions for Machine learning from scratch, what to build before using scikit-learn
Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
📝
Natural Language Processing
Content type:
Audio
oreilly.com
·
23h
23 hours ago
Actions for Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
DiffusionGemma: Discrete diffusion in a large language
model
🤖
LLM
idlemachines.co.uk
·
1h
1 hour ago
·
Hacker News
Actions for DiffusionGemma: Discrete diffusion in a large language model
Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
🤖
AI
huggingface.co
·
1h
1 hour ago
·
r/LocalLLaMA
Actions for Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
Guardian Angels:
LLM
Personalization for Productivity and Security
⛓️
LangChain
gwern.net
·
4d
4 days ago
·
Hacker News
Actions for Guardian Angels: LLM Personalization for Productivity and Security
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help