Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
🤖 Transformers
Specific
Attention Mechanism, Self-Attention, BERT, Architecture
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
68
posts in
13.0
ms
History of WYSIWYG editors and CMS: a timeline (2022)
💾
Retro Computing
Content type:
Blog
tiny.cloud
·
11h
11 hours ago
·
Hacker News
Actions for History of WYSIWYG editors and CMS: a timeline (2022)
Transformer
Based
Model
for Spatiotemporal Feature Learning in EEG Emotion Recognition
📡
Signal Processing
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Transformer Based Model for Spatiotemporal Feature Learning in EEG Emotion Recognition
See, Act, Correct: three levers for working with a code agent
🎮
Reinforcement Learning
Content type:
Blog
blog.owulveryck.info
·
6d
6 days ago
·
Hacker News
,
Hacker News
Actions for See, Act, Correct: three levers for working with a code agent
Introducing the Third Generation of Apple’s Foundation
Models
🤖
AI
machinelearning.apple.com
·
3d
3 days ago
·
Hacker News
,
r/apple
Actions for Introducing the Third Generation of Apple’s Foundation Models
Hasse Diagrams for
Attention
: A Partial Order Framework for Designing
Transformer
Masks
🧠
LLM
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Hasse Diagrams for Attention: A Partial Order Framework for Designing Transformer Masks
Human-Like
Neural
Nets
by Catapulting
🧠
LLM
gwern.net
·
4d
4 days ago
·
Hacker News
Actions for Human-Like Neural Nets by Catapulting
Beyond Item IDs: Scaling Short-Form-Video Recommendation via Semantic-Native Long Sequence
Modeling
💬
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Beyond Item IDs: Scaling Short-Form-Video Recommendation via Semantic-Native Long Sequence Modeling
Transformer-Enhanced
Reinforcement Learning: Fundamentals and Applications in Communication
Networks
🤖
AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Transformer-Enhanced Reinforcement Learning: Fundamentals and Applications in Communication Networks
princezuda/-RequiemGPT-: Fully open source and open weights built and trained by fable five with one prompt. An experience in how AI actually works
🤖
AI
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for princezuda/-RequiemGPT-: Fully open source and open weights built and trained by fable five with one prompt. An experience in how AI actually works
Attention
at the Theoretical Minimum: A Mathematics of Arrays Framework for Memory-Optimal
Transformer
Kernels
🧠
LLM
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Attention at the Theoretical Minimum: A Mathematics of Arrays Framework for Memory-Optimal Transformer Kernels
DeepSeek Made AI Cheap. Now It Needs Billions to Keep It Cheap.
🚀
Startups
Content type:
News
Content type:
Blog
chinacompany.substack.com
·
6d
6 days ago
·
Substack
Actions for DeepSeek Made AI Cheap. Now It Needs Billions to Keep It Cheap.
Best-Known Sorting
Networks
🗄️
Vector Databases
bertdobbelaere.github.io
·
6d
6 days ago
·
Hacker News
Actions for Best-Known Sorting Networks
From
Architecture
to Output: Structural Origins of Hallucination in
Large
Language
Models and the Amplifying Role of Data
📊
Statistics
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for From Architecture to Output: Structural Origins of Hallucination in Large Language Models and the Amplifying Role of Data
Overcoming
Decoder
Inconsistencies in Whisper for Dravidian and Low-Resource
Languages
🧠
LLM
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Overcoming Decoder Inconsistencies in Whisper for Dravidian and Low-Resource Languages
A Mean-Field Analysis of
Multi-Head
Self-Attention
under Cross-Entropy Training
📐
Optimization Theory
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training
Introducing Granite Libraries and Project Granite Switch
🤖
AI
Content type:
Blog
research.ibm.com
·
6d
6 days ago
·
Hacker News
Actions for Introducing Granite Libraries and Project Granite Switch
Parallel Causal Associative Fields: Gated Sparse Memory for Long-Context
Language
Modeling
🎛️
Control Systems
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Parallel Causal Associative Fields: Gated Sparse Memory for Long-Context Language Modeling
Learning from flowsheets: A generative
transformer
model
for autocompletion of flowsheets
🛠️
Developer Tools
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Learning from flowsheets: A generative transformer model for autocompletion of flowsheets
DMT: Demographic Conditioning, Morphology-Enhanced
Transformer
for Cuffless Blood Pressure Estimation from PPG Signals
📶
Communications
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for DMT: Demographic Conditioning, Morphology-Enhanced Transformer for Cuffless Blood Pressure Estimation from PPG Signals
Towards Tight Bounds for Streaming
Attention
🤖
AI
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Towards Tight Bounds for Streaming Attention
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help