Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 LLM Inference
Quantization, Attention Mechanisms, Batch Processing, KV Caching
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
4076
posts in
51.2
ms
Databricks adds
MemAlign
to
MLflow
to cut cost and latency of LLM evaluation
infoworld.com
·
6d
🧠
Local llm
Running LLMs in-browser via
WebGPU
, Transformers.js, and Chrome's Prompt API—no
Ollama
, no server
noaibills.app
·
4d
·
Discuss:
r/LocalLLaMA
,
r/SideProject
,
r/selfhosted
🧠
Local llm
do you know more modern version of something like
byt5-small
?
huggingface.co
·
3d
·
Discuss:
r/LocalLLaMA
🤖
Machine Learning
The 8GB VRAM Image Model That Feels Instant: Meet FLUX.2
Klein
4B
hackernoon.com
·
3d
✨
Gleam
The
UI
: Why It's the Real AI Agent
Bottleneck
hackernoon.com
·
2d
📊
Prometheus
Making a Hardware Accelerated Live TV Player from Scratch in C: HLS Streaming,
MPEG-TS
Demuxing
, H.264 Parsing, and Vulkan Video Decoding
blog.jaysmito.dev
·
2d
·
Discuss:
Hacker News
,
r/programming
📊
Prometheus
AI
workloads
challenge the
cattle
model
varoa.net
·
4d
·
Discuss:
Hacker News
☸️
Kubernetes
Achieving
Ultra-Fast AI Chat
Widgets
cjroth.com
·
3d
·
Discuss:
Hacker News
📊
Prometheus
Issue 637
datascienceweekly.substack.com
·
5d
·
Discuss:
Substack
🤖
Machine Learning
Show HN:
Routed
Attention – 75-99% savings by routing between O(N) and O(
N²
)
zenodo.org
·
4d
·
Discuss:
Hacker News
👁️
Observability
Seedance
2.0 preview: The best video model of 2026,
outperforming
Sora 2
news.ycombinator.com
·
2d
·
Discuss:
Hacker News
✨
Gleam
Hallucinations
in
GPT5
– Can models say "I don't know" (June 2025)
jobswithgpt.com
·
4d
·
Discuss:
Hacker News
📊
Prometheus
Shared
LoRA
Subspaces
for almost Strict Continual Learning
arxiv.org
·
5d
·
Discuss:
Hacker News
🤖
Machine Learning
Generative
Modeling
via
Drifting
lambertae.github.io
·
6d
·
Discuss:
Hacker News
🤖
Machine Learning
Beyond
agentic
coding
haskellforall.com
·
4d
·
Discuss:
Lobsters
,
Hacker News
,
Hacker News
,
r/programming
📊
Prometheus
For real
game-theoretic
reasoning, we need best response in
imperfect
information games
weyxie.bearblog.dev
·
2d
·
Discuss:
Hacker News
👁️
Observability
Heterogeneous
Processing: A Strategy for
Augmenting
Moore's Law (2006)
linuxjournal.com
·
3d
·
Discuss:
Hacker News
👁️
Observability
Writing an LLM from scratch, part
32b
-- Interventions: gradient
clipping
gilesthomas.com
·
6d
·
Discuss:
Hacker News
🤖
Machine Learning
We
recreated
the Anthropic C
compiler
agent
vizops.ai
·
2d
·
Discuss:
Hacker News
🕸️
WebAssembly
EU AI Act
Compliance
for
Enterprise
AI Systems: What Your Engineering Team Needs to Build
medium.com
·
2d
·
Discuss:
Hacker News
📊
Prometheus
Loading...
Loading more...
« Page 7
•
Page 9 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help