Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
⚡ Transformers
Specific
Attention Mechanism, BERT, GPT, Sequence Models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
205
posts in
6.3
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🤖
AI
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
Audio-Visual Exchange-Aware Token Pruning for Efficient Audio-Visual Captioning
👁️
Computer Vision
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for Audio-Visual Exchange-Aware Token Pruning for Efficient Audio-Visual Captioning
Contribution Weights: A Geometrical Analysis of
Self-Attention
Transformers
💬
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Contribution Weights: A Geometrical Analysis of Self-Attention Transformers
Operator Fusion for
LLM
Inference on the Tensix Architecture
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for Operator Fusion for LLM Inference on the Tensix Architecture
Post-training
is (Massive) Supervised Learning
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Post-training is (Massive) Supervised Learning
mingusb/transformer-golf
: The Fully Unrolled
Transformer
: An experimental repository for architecture simplification and compilation. [2026]
🧠
Neural Networks
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]
DUET -- Dual User Embedding
Transformers
for Offsite Conversion Prediction
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for DUET -- Dual User Embedding Transformers for Offsite Conversion Prediction
Early Comparative Evaluation of
Transformer
Models
for Multilingual Software Vulnerability Detection
💬
LLMs
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for Early Comparative Evaluation of Transformer Models for Multilingual Software Vulnerability Detection
Uncertainty-Aware
LLM-Guided
Policy Shaping for Sparse-Reward Reinforcement Learning
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning
mikinko/HuggingFace
_WFX: Total Commander WFX plugin for
HuggingFace
repos
🤖
AI
Content type:
Code
github.com
·
4d
4 days ago
·
r/StableDiffusion
Actions for mikinko/HuggingFace_WFX: Total Commander WFX plugin for HuggingFace repos
Beyond Patches: Superpixel Token-based
Transformers
for Attribute-Specific Fashion Retrieval
🤖
AI
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for Beyond Patches: Superpixel Token-based Transformers for Attribute-Specific Fashion Retrieval
Chiaroscuro
Attention
: Spending Compute in the Dark
⚡
Flash Attention
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Chiaroscuro Attention: Spending Compute in the Dark
Transformer
Based
Model
for Spatiotemporal Feature Learning in EEG Emotion Recognition
🧮
Complexity Theory
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for Transformer Based Model for Spatiotemporal Feature Learning in EEG Emotion Recognition
tenurehq/precisionMemBench: Precision-aware retrieval benchmark for
LLM
memory systems.
🤖
AI
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for tenurehq/precisionMemBench: Precision-aware retrieval benchmark for LLM memory systems.
Look Less, Reason More: Block-wise
Attention
Skipping for Efficient Multimodal LLMs
👁️
Computer Vision
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Look Less, Reason More: Block-wise Attention Skipping for Efficient Multimodal LLMs
InA-Probe: Instruction-Aware Active Probing for Time Series Forecasting with LLMs
📈
Time Series Analysis
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for InA-Probe: Instruction-Aware Active Probing for Time Series Forecasting with LLMs
FuseFSS: Efficient Secure
LLM
Inference with Function Secret Sharing
💬
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for FuseFSS: Efficient Secure LLM Inference with Function Secret Sharing
When Vision Misleads, Let Location Speak: A Worldwide Image Geo-Localization Method via Location
Attention
Mechanism
and Large Multimodal
Models
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for When Vision Misleads, Let Location Speak: A Worldwide Image Geo-Localization Method via Location Attention Mechanism and Large Multimodal Models
Inside the
LLM
Word Factory
💬
Natural Language Processing
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Inside the LLM Word Factory
TextEconomizer: Enhancing Lossy Text Compression with Denoising
Transformers
and Entropy Coding
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for TextEconomizer: Enhancing Lossy Text Compression with Denoising Transformers and Entropy Coding
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help