Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚙️ MLOps
Specific
model deployment, ML pipelines, inference, model serving
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
7148
posts in
18.1
ms
MCAP
: Deployment-Time Layer
Profiling
for Memory-Constrained LLM Inference
📱
Edge AI Optimization
arxiv.org
·
6d
The Data
Layer
Tax for Robot Learning
🧠
Machine Learning
rerun.io
·
2h
·
Hacker News
GoogleCloudPlatform/activation-model-scanner
:
Verify
language model safety before deployment by analyzing activation patterns
💉
Prompt Injection
github.com
·
11h
·
Hacker News
How we built the most performant DeepSeek V3.2, MiniMax-M2.5 and Qwen 3.5
397B
on DigitalOcean NVIDIA
HGX
™ B300 GPU Droplets
📱
Edge AI Optimization
digitalocean.com
·
2d
Three
Cobblers
, One
Zhuge
Liang: Making Cheaper Models Work Together
🪄
Prompt Engineering
markhuang.ai
·
15h
·
Hacker News
vLLM-Lens
: Fast Interpretability
Tooling
That Scales to Trillion-Parameter Models
📱
Edge AI Optimization
lesswrong.com
·
6d
What agentic AI
borrowed
from
microservices
(and made worse)
🔧
Agent Tooling
temporal.io
·
20h
·
Hacker News
umbecanessa/neural-ledger-system
: An inference architecture that makes LLMs
stateful
. Patent pending (US 64/050,345).
🪄
Prompt Engineering
github.com
·
1d
·
Hacker News
AutoPyVerifier
: Learning Compact Executable
Verifiers
for Large Language Model Outputs
✅
Formal Verification
arxiv.org
·
2d
Lessons from Building an
OTel
Normalizer
for GenAI (Part 1)
🪝
eBPF
groundcover.com
·
10h
·
Hacker News
Monitoring LLM behavior: Drift,
retries
, and
refusal
patterns
🛡️
AI Safety
venturebeat.com
·
5d
·
Hacker News
Fixing
What LLMs Get Wrong (22 minute read)
🪄
Prompt Engineering
thebigdataguy.substack.com
·
3d
·
Substack
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from
Raw
Corpora
✨
LLMs
arxiv.org
·
1d
DeepSeek-V4 on Day 0: From Fast Inference to Verified
RL
with
SGLang
and Miles
📱
Edge AI Optimization
lmsys.org
·
4d
·
Hacker News
Adaptive and Fine-grained Module-wise Expert Pruning for Efficient
LoRA-MoE
Fine-Tuning
🤖
LLM
arxiv.org
·
11h
Building
Semantic
Version Control in Rust
⚙️
Compilers
therohansharma.com
·
4d
·
Hacker News
A Survey on Split Learning for LLM
Fine-Tuning
: Models, Systems, and Privacy
Optimizations
✨
LLMs
arxiv.org
·
2d
Show HN:
Récif
– Open-source control
tower
for AI agents on Kubernetes
🔧
Agent Tooling
recif-platform.github.io
·
6d
·
Hacker News
Rcarmo/gte-go
: Golang inference for the
GTE
Small embedding model
🤖
LLM
github.com
·
5d
·
Hacker News
AI
Observability
for Large Language Model Systems: A Multi-Layer Analysis of Monitoring Approaches from Confidence
Calibration
to Infrastructure Tracing
🛡️
AI Safety
arxiv.org
·
11h
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help