Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
⚡ Transformers
Specific
Attention Mechanism, BERT, GPT, Sequence Models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
206
posts in
8.0
ms
TextEconomizer: Enhancing Lossy Text Compression with Denoising
Transformers
and Entropy Coding
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for TextEconomizer: Enhancing Lossy Text Compression with Denoising Transformers and Entropy Coding
Towards Tight Bounds for Streaming
Attention
🤖
AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Towards Tight Bounds for Streaming Attention
Beyond Item IDs: Scaling Short-Form-Video Recommendation via Semantic-Native Long
Sequence
Modeling
🧮
Complexity Theory
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Beyond Item IDs: Scaling Short-Form-Video Recommendation via Semantic-Native Long Sequence Modeling
Selective
Coupling of Decoupled Informative Regions: Masked
Attention
Alignment for Data-Free Quantization of Vision
Transformers
🤖
AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Selective Coupling of Decoupled Informative Regions: Masked Attention Alignment for Data-Free Quantization of Vision Transformers
Attention
at the Theoretical Minimum: A Mathematics of Arrays Framework for Memory-Optimal
Transformer
Kernels
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Attention at the Theoretical Minimum: A Mathematics of Arrays Framework for Memory-Optimal Transformer Kernels
LazyAttention: Efficient Retrieval-Augmented Generation with Deferred
Positional
Encoding
🤖
AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for LazyAttention: Efficient Retrieval-Augmented Generation with Deferred Positional Encoding
Query-based Cross-Modal Projector Bolstering Mamba Multimodal
LLM
🤖
AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Query-based Cross-Modal Projector Bolstering Mamba Multimodal LLM
Signed Dual
Attention
: Capturing Signed Dependencies in Time Series Forecasting
🤖
AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Signed Dual Attention: Capturing Signed Dependencies in Time Series Forecasting
Transformer-Enhanced
Reinforcement Learning: Fundamentals and Applications in Communication
Networks
🤖
AI
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Transformer-Enhanced Reinforcement Learning: Fundamentals and Applications in Communication Networks
ATT-CR
: Adaptive Triangular
Transformer
for Cloud Removal
🧮
Complexity Theory
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for ATT-CR: Adaptive Triangular Transformer for Cloud Removal
Depth-Attention
: Cross-Layer Value Mixing for
Language
Models
📈
Optimization
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Depth-Attention: Cross-Layer Value Mixing for Language Models
Imbuing Large
Language
Models
with Bidirectional Logic for Robust Chain Repair
🤖
AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Imbuing Large Language Models with Bidirectional Logic for Robust Chain Repair
An Empirical Audit of Input Encoders for Multi-Channel Signal
Transformers
⚡
Quantization
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for An Empirical Audit of Input Encoders for Multi-Channel Signal Transformers
GRAMformer: Any-Order Modality Interactions via Volumetric Multimodal
Cross-Attention
🤖
AI
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for GRAMformer: Any-Order Modality Interactions via Volumetric Multimodal Cross-Attention
Phase transitions for the noisy
transformer
model
in arbitrary dimension
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Phase transitions for the noisy transformer model in arbitrary dimension
Do
Transformers
Need Three Projections? Systematic Study of QKV Variants
⚡
Quantization
Content type:
Academic
arxiv.org
·
6d
6 days ago
·
Hacker News
Actions for Do Transformers Need Three Projections? Systematic Study of QKV Variants
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help