Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 LLM Inference
Quantization, Attention Mechanisms, Batch Processing, KV Caching
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
2743
posts in
91.0
ms
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
3d
·
Discuss:
Hacker News
🤖
Machine Learning
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
1d
·
Discuss:
Hacker News
🤖
Machine Learning
AI-augmented
data quality engineering
infoworld.com
·
9h
🤖
Machine Learning
Tutorial – What is a
variational
autoencoder
?
jaan.io
·
2h
·
Discuss:
Hacker News
🤖
Machine Learning
Recursive
Deductive
Verification: A framework for reducing AI
hallucinations
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
📊
Prometheus
A Note on
Flat
Abstract
Syntax
Trees
gist.github.com
·
59m
·
Discuss:
Hacker News
🕸️
WebAssembly
Lekh
AI v2.0 is out – Big offline AI update, Better memory and llama
GGUF
models support. Mac app coming next week.
apps.apple.com
·
18h
·
Discuss:
r/LocalLLaMA
🏠
Self-Hosting
LLMs Are Prediction
Machines
kaelandt.github.io
·
1d
·
Discuss:
Hacker News
🤖
Machine Learning
AI attention
span
so good it
shouldn
’t be legal
stackoverflow.blog
·
3d
🤖
Machine Learning
Sequential Attention: Making AI models
leaner
and faster without
sacrificing
accuracy
research.google
·
5d
·
Discuss:
Hacker News
,
r/LocalLLaMA
🤖
Machine Learning
Towards
Understanding What State Space Models Learn About Code
arxiv.org
·
14h
·
Discuss:
Hacker News
👁️
Observability
ML-LIB
: Machine Learning Library Proposed For The Linux Kernel
phoronix.com
·
2d
·
Discuss:
Hacker News
🤖
Machine Learning
Continual
learning and the post
monolith
AI era
baseten.co
·
2d
·
Discuss:
Hacker News
🤖
Machine Learning
Designing
a Cost-Efficient
Agentic
System
p.agnihotry.com
·
1h
·
Discuss:
Hacker News
📊
Prometheus
Testing 80 LLMs on
spatial
reasoning on
grids
mihai.page
·
19h
·
Discuss:
Hacker News
📊
Prometheus
An
attempt
at a
First-Proof
AI challenge
abhvio.us
·
1d
·
Discuss:
Hacker News
👁️
Observability
Circumstantial
Complexity
, LLMs and Large Scale Architecture
datagubbe.se
·
1h
·
Discuss:
Hacker News
🕸️
WebAssembly
Human-like Search for Modern
Applications
anvitra.ai
·
1d
·
Discuss:
Hacker News
🤖
Machine Learning
Show HN: Model Training Memory
Simulator
czheo.github.io
·
1d
·
Discuss:
Hacker News
🤖
Machine Learning
The Rise of Local Speech
Recognition
oatmealapp.com
·
1d
·
Discuss:
Hacker News
🤖
Machine Learning
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help