Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformer Architecture
🤖 Transformer Architecture
Specific
Self-Attention, BERT, GPT, Multi-Head Attention
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
132
posts in
4.6
ms
markusheimerl/gpt
: A generative pretrained
transformer
implementation
🧠
Neural Network Architectures
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for markusheimerl/gpt: A generative pretrained transformer implementation
Reachability and asymptotics of Gaussian
Transformer
dynamics
🧠
Deep Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Reachability and asymptotics of Gaussian Transformer dynamics
Machine learning from scratch, what to build before using scikit-learn
🧠
Neural Network Architectures
Content type:
Tutorial
iwtlp.com
·
17h
17 hours ago
·
DEV
Actions for Machine learning from scratch, what to build before using scikit-learn
Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens
👁️
Attention Mechanisms
Content type:
Blog
medium.com
·
4h
4 hours ago
Actions for Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens
ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks
👁️
Attention Mechanisms
Content type:
Blog
Content type:
Tutorial
appliedaihub.org
·
1d
1 day ago
·
r/PromptEngineering
Actions for ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks
Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented
CNN-transformer
model
🧠
Deep Learning
Content type:
Academic
nature.com
·
6d
6 days ago
Actions for Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model
The Sequence Knowledge #874:
Transformers
or Not?
🧠
Deep Learning
substackcdn.com
·
1d
1 day ago
·
Substack
Actions for The Sequence Knowledge #874: Transformers or Not?
Less-relevant results
Multimodal
Browser AI with
Transformers.js
for Images and Speech
🧠
Deep Learning
machinelearningmastery.com
·
19h
19 hours ago
Actions for Multimodal Browser AI with Transformers.js for Images and Speech
know the mother tongue of your LLMs
🔮
ML
mothertoken.inigoimaz.com
·
1d
1 day ago
·
Hacker News
Actions for know the mother tongue of your LLMs
How LLMs Actually Work: A Friendly Map for Humans • oreoro
👁️
Attention Mechanisms
oreoro.github.io
·
5d
5 days ago
·
Hacker News
Actions for How LLMs Actually Work: A Friendly Map for Humans • oreoro
Visual Artist and Percussionist Bob
Bert
(Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)
🚀
Model Deployment
glidemagazine.com
·
1d
1 day ago
Actions for Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)
AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence
🧠
Neural Network Architectures
techradar.com
·
6d
6 days ago
Actions for AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence
The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again
🧠
Neural Network Architectures
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again
Adventurer becomes first British woman to cross Atlantic by hydrogen balloon
🔮
ML
Content type:
News
the-independent.com
·
3d
3 days ago
Actions for Adventurer becomes first British woman to cross Atlantic by hydrogen balloon
Pathetic pretense
🎲
Synthetic Data Generation
Content type:
Blog
freethoughtblogs.com
·
1d
1 day ago
Actions for Pathetic pretense
Researchers say they trained a foundation
model
from scratch for about $1,500
📈
Time Series Forecasting
venturebeat.com
·
9h
9 hours ago
Actions for Researchers say they trained a foundation model from scratch for about $1,500
Context windows in AI: why every token is a budget decision
🧠
Deep Learning
Content type:
Blog
redis.io
·
12h
12 hours ago
Actions for Context windows in AI: why every token is a budget decision
What shapes your power bill? Explainable AI outlines forecasts behind grid and price decisions
📈
Time Series Forecasting
techxplore.com
·
2d
2 days ago
Actions for What shapes your power bill? Explainable AI outlines forecasts behind grid and price decisions
We Taught a
Model
to Speak Legalese. Here’s What Changed.
🔮
ML
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for We Taught a Model to Speak Legalese. Here’s What Changed.
What the ocean taught me about AI.
🧠
Neural Network Architectures
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for What the ocean taught me about AI.
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help