Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 LLM Inference
Quantization, Attention Mechanisms, Batch Processing, KV Caching
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
4065
posts in
113.3
ms
EyesOff
: Why Some Models
Quantize
Better Than Others
ym2132.github.io
·
10h
·
Discuss:
Hacker News
🤖
Machine Learning
Guney-olu/nanoslg
: A from-scratch implementation of distributed LLM inference in simple readable Python
github.com
·
2d
·
Discuss:
Hacker News
,
r/LLM
🐢
Turso
Beyond
Kuramoto
Models: Associative Memory and Plastic
Synapses
in ML Ensembles
hackernoon.com
·
17h
🤖
Machine Learning
Architectural and Mathematical
Foundations
of Machine Learning: A
Rigorous
Synthesis of Theory, Geometry, and Implementation
chizkidd.github.io
·
19h
·
Discuss:
Hacker News
🤖
Machine Learning
First look: Run LLMs
locally
with
LM
Studio
infoworld.com
·
23h
🧠
Local llm
GLM
5 is already on
huggingface
!
huggingface.co
·
15h
·
Discuss:
r/LocalLLaMA
🧠
Local llm
Biases
in the Blind Spot: Detecting What LLMs Fail to
Mention
arxiv.org
·
1d
·
Discuss:
Hacker News
🧠
Local llm
Statistical Models for the Latent Space: From Gaussian
VAE
to
Kuramoto-Enhanced
S-VAE
hackernoon.com
·
2d
🤖
Machine Learning
Overview of end-to-end
encrypted
AI inference for
Confer
news.ycombinator.com
·
14h
·
Discuss:
Hacker News
🤖
Machine Learning
AI-augmented
data quality engineering
infoworld.com
·
2d
🤖
Machine Learning
Digitizing
the "
Shokunin
": How we encoded a Master's hammer strike into AI
yusukekaizen.substack.com
·
2h
·
Discuss:
Substack
🤖
Machine Learning
Transformer-Based Memory Forecasting: Leveraging
Anonymized
Aggregates
for Personal Insights
novice.media
·
11h
·
Discuss:
Hacker News
📊
Prometheus
Show HN: Fighting the War Against
Expensive
Reinforcement
Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
1h
·
Discuss:
Hacker News
🤖
Machine Learning
Show HN: Latent-k –
Persistent
dependency
map to reduce AI coding token usage
latentk.org
·
19h
·
Discuss:
Hacker News
🧠
Local llm
Memsearch
,an agent memory with md as source of truth(inspired by
OpenClaw
)
zilliztech.github.io
·
6h
·
Discuss:
Hacker News
📊
Prometheus
How We Built the Fastest
Kimi
K2.5
on Artificial Analysis
baseten.co
·
17h
·
Discuss:
Hacker News
🤖
Machine Learning
Automating Inference Optimizations with NVIDIA
TensorRT
LLM
AutoDeploy
developer.nvidia.com
·
2d
·
Discuss:
Hacker News
🤖
Machine Learning
Dear
Agent:
Prove
it.
rijnard.com
·
4h
·
Discuss:
Hacker News
📊
Prometheus
Ask HN: Where does this
adversarial
prize
mechanism
break?
news.ycombinator.com
·
7h
·
Discuss:
Hacker News
🧠
Local llm
A Note on
Flat
Abstract
Syntax
Trees
gist.github.com
·
2d
·
Discuss:
Hacker News
🕸️
WebAssembly
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help