Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Inference
๐ง LLM Inference
Specific
Quantization, Attention Mechanisms, Batch Processing, KV Caching
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
293
posts in
26.9
ms
๐๏ธ
LLM Infrastructure
GitHub
ยท
3d
3 days ago
Profile(v2.1.4) physics-aware optimizer for
vLLM
(31โ470 tok/s on A100)
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Profile(v2.1.4) physics-aware optimizer for vLLM (31โ470 tok/s on A100)
๐
LLM Benchmarking
arxiv.org
ยท
5d
5 days ago
Towards Distributed
Inference
of LLMs on a P2P Network
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Towards Distributed Inference of LLMs on a P2P Network
๐ค
AI
GitHub
ยท
5d
5 days ago
Native
Inference
Engine for macOS 14 or newer
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Native Inference Engine for macOS 14 or newer
๐๏ธ
LLM Infrastructure
arxiv.org
ยท
6d
6 days ago
KVEraser: Learning to Steer
KV
Cache
for Efficient Localized Context Erasing
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for KVEraser: Learning to Steer KV Cache for Efficient Localized Context Erasing
๐ป
Claude Code
GitHub
ยท
6d
6 days ago
fix(status): ignore stale context after
model
switch (#93306)
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for fix(status): ignore stale context after model switch (#93306)
๐พ
Prompt Caching
arxiv.org
ยท
5d
5 days ago
Models
Take Notes at Prefill:
KV
Cache
Can Be Editable and Composable
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Models Take Notes at Prefill: KV Cache Can Be Editable and Composable
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report