Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformer Architecture
🤖 Transformer Architecture
Specific
Attention, BERT, GPT, Sequence Models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
163
posts in
8.4
ms
markusheimerl/gpt
: A generative pretrained
transformer
implementation
🔗
RAG
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for markusheimerl/gpt: A generative pretrained transformer implementation
know the mother tongue of your LLMs
🤖
Local LLMs
mothertoken.inigoimaz.com
·
1d
1 day ago
·
Hacker News
Actions for know the mother tongue of your LLMs
PENet+: A Lightweight Residual
Transformer
Framework for Efficient Image Steganalysis
🔗
RAG
Content type:
Academic
arxiv.org
·
8h
8 hours ago
Actions for PENet+: A Lightweight Residual Transformer Framework for Efficient Image Steganalysis
How Confident Are AI Classifiers About Their Own Confidence?
💬
Natural Language Processing
Content type:
Blog
gmcirco.github.io
·
1d
1 day ago
·
Hacker News
Actions for How Confident Are AI Classifiers About Their Own Confidence?
How LLMs Actually Work: A Friendly Map for Humans • oreoro
💬
Natural Language Processing
oreoro.github.io
·
4d
4 days ago
·
Hacker News
Actions for How LLMs Actually Work: A Friendly Map for Humans • oreoro
Visual Artist and Percussionist Bob
Bert
(Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)
🎺
Jazz
glidemagazine.com
·
13h
13 hours ago
Actions for Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)
The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again
🤖
Machine Learning
Content type:
Blog
medium.com
·
1d
1 day ago
Actions for The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again
AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence
💬
Natural Language Processing
techradar.com
·
5d
5 days ago
Actions for AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence
Attention
Based Interpretability With Concept
Transformer
🔍
Vector Search
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for Attention Based Interpretability With Concept Transformer
Why LLMs hallucinate?
🧠
LLM Reasoning
Content type:
Blog
medium.com
·
4d
4 days ago
Actions for Why LLMs hallucinate?
The
Transformer
, Demystified — Let's Actually Build One
📝
TextRank
Content type:
News
mlwhiz.com
·
4d
4 days ago
Actions for The Transformer, Demystified — Let's Actually Build One
Post-training is (Massive) Supervised Learning
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Post-training is (Massive) Supervised Learning
Issue #390 - The ML Engineer 🤖
💬
Natural Language Processing
Content type:
News
Content type:
Blog
machinelearning.substack.com
·
3d
3 days ago
·
Substack
Actions for Issue #390 - The ML Engineer 🤖
Guardian Angels:
LLM
Personalization for Productivity and Security
🎭
Anthropic Claude
gwern.net
·
3d
3 days ago
·
Hacker News
Actions for Guardian Angels: LLM Personalization for Productivity and Security
MLPerf and the rise of latency-aware
LLM
benchmarking
💬
Natural Language Processing
edn.com
·
5d
5 days ago
Actions for MLPerf and the rise of latency-aware LLM benchmarking
What an
LLM
Actually Does With Your Prompt First
💬
Natural Language Processing
siliconopera.com
·
5d
5 days ago
Actions for What an LLM Actually Does With Your Prompt First
Towards Tight Bounds for Streaming
Attention
🧮
Algorithms
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Towards Tight Bounds for Streaming Attention
My research agenda and work
🧩
Cognitive Science
lesswrong.com
·
4d
4 days ago
Actions for My research agenda and work
Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented
CNN-transformer
model
🧠
Deep Learning
Content type:
Academic
nature.com
·
5d
5 days ago
Actions for Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model
Uncertainty-Aware
LLM-Guided
Policy Shaping for Sparse-Reward Reinforcement Learning
🧠
LLM Reasoning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help