Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 LLM Inference
Specific
Quantization, Attention Mechanisms, Batch Processing, KV Caching
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
186313
posts in
63.4
ms
One
Refiner
to Unlock Them All: Inference-Time Reasoning
Elicitation
via Reinforcement Query Refinement
🏗️
LLM Infrastructure
arxiv.org
·
2d
Scaling Multi-Node
Mixture-of-Experts
Inference Using Expert
Activation
Patterns
🧩
MoE
arxiv.org
·
3d
Nautile-370M
: Spectral Memory Meets Attention in a Small Reasoning Model
🪄
Prompt Engineering
arxiv.org
·
2d
When Hidden States Drift: Can
KV
Caches
Rescue Long-Range Speculative Decoding?
🔄
Cache Coherence
arxiv.org
·
1d
Large Language Models
Decide
Early and
Explain
Later
🏗️
LLM Infrastructure
arxiv.org
·
4d
Hardware Generation and Exploration of
Lookup
Table-Based
Accelerators
for 1.58-bit LLM Inference
⚡
Hardware Acceleration
arxiv.org
·
2d
HGQ-LUT
: Fast
LUT-Aware
Training and Efficient Architectures for DNN Inference
🔢
BitNet Inference
arxiv.org
·
4d
The Surprising
Universality
of LLM Outputs: A Real-Time Verification
Primitive
🏗️
LLM Infrastructure
arxiv.org
·
2d
CoQuant
: Joint Weight-Activation
Subspace
Projection for Mixed-Precision LLMs
🎯
Vector Quantization
arxiv.org
·
1d
Coverage-Based Calibration for Post-Training Quantization via
Weighted
Set Cover over
Outlier
Channels
🎯
Vector Quantization
arxiv.org
·
3d
Thinking Without Words: Efficient
Latent
Reasoning with
Abstract
Chain-of-Thought
🧠
Agent Memory
arxiv.org
·
4d
SpikingBrain2.0
: Brain-Inspired Foundation Models for Efficient Long-Context and Cross-Platform
Inference
🔢
BitNet Inference
arxiv.org
·
4d
« Page 1
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help