Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🤖 LLM Inference
Model Serving, Quantization, vLLM, ONNX Runtime
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
172174
posts in
10.2
ms
Accuracy
vs. Speed in Local LLMs: Finding Your
Sweet
Spot
grigio.org
·
3h
·
Discuss:
Hacker News
🚀
Performance
Optimizing LLM Inference: Sparse Activation, MoE, and
Gated-MLP
Efficiency
hackernoon.com
·
1d
🧠
LLMs
DualPath
: Breaking the Storage
Bandwidth
Bottleneck in Agentic LLM Inference
arxiv.org
·
2d
·
Discuss:
Hacker News
📡
Edge AI
Min-p
sampling
for LLMs
thoughtworks.com
·
14h
🧠
LLMs
ConstraintBench
:
Benchmarking
LLM Constraint Reasoning on Direct Optimization
arxiv.org
·
1d
🧠
LLMs
Large model inference
container
– latest capabilities and performance
enhancements
aws.amazon.com
·
1d
⚙️
MLOps
Probabilistic Graph Neural Inference for bio-inspired soft robotics maintenance with ethical
auditability
baked
in
dev.to
·
3h
·
Discuss:
DEV
📡
Edge AI
Some notes on
unreliability
of LLM
APIs
andrewpwheeler.com
·
19h
·
Discuss:
Hacker News
⚙️
MLOps
Unsloth
Dynamic 2.0
GGUFs
unsloth.ai
·
3h
·
Discuss:
Hacker News
⚙️
MLOps
brendanhogan/base-model-agents
github.com
·
3h
🤖
AI
dReLU
Sparsification: Recovering LLM Performance with
150B
Token Pretraining
hackernoon.com
·
12h
🧠
LLMs
🚀 Stop
Guessing
Which LLM
Runs
on Your Machine
dev.to
·
1h
·
Discuss:
DEV
📊
Profiling Tools
Asura
:
Looped
Language Models done better
neel04.github.io
·
2d
·
Discuss:
Hacker News
🧠
LLMs
The 4 LLM
Evaluation
Frameworks
: How to Benchmark AI Like Google and OpenAI Do
pub.towardsai.net
·
22h
🧠
LLMs
LLM-Based Evolution as a
Universal
Optimizer
imbue.com
·
15h
·
Discuss:
Hacker News
📊
Incremental Computation
Reinforcement
Learning for LLMs
mesuvash.github.io
·
2d
·
Discuss:
Hacker News
🧠
LLMs
"LLMs Out of
Context
"
lucek.ai
·
1d
·
Discuss:
Hacker News
🧠
LLMs
Evaluating
LLMs using semantic
entropy
research.thoughtworks.com
·
14h
🧠
LLMs
Generative AI - a case of
mismatched
expectations
thoughtworks.com
·
14h
🧠
LLMs
Understanding Large Language Models (LLMs)
insightsonindia.com
·
2d
🧠
LLMs
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help