Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚙️ MLOps
model deployment, ML infrastructure, model serving, pipelines
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
183979
posts in
23.6
ms
MCAP
: Deployment-Time Layer
Profiling
for Memory-Constrained LLM Inference
🤖
LLMs
arxiv.org
·
6d
Build Strands Agents with
SageMaker
AI models and
MLflow
🛠️
AI Dev Tools
aws.amazon.com
·
2d
Three
Cobblers
, One
Zhuge
Liang: Making Cheaper Models Work Together
🛠️
AI Dev Tools
markhuang.ai
·
15h
·
Hacker News
AmSach/kvquant
: Drop-in KV cache compressor for local LLM inference - Run 70B models on 8GB RAM
🤖
LLMs
github.com
·
3h
·
DEV
Introducing
DigitalOcean
AI-Native Cloud for Production AI
Workloads
🏗️
Infrastructure
digitalocean.com
·
1d
What agentic AI
borrowed
from
microservices
(and made worse)
📐
System Design
temporal.io
·
20h
·
Hacker News
How Does CI/CD
Differ
for Machine Learning Pipelines (
MLOps
)?
🛠️
AI Dev Tools
semaphore.io
·
6d
Best
Practices
for inference on Edge AI
MCUs
📐
System Design
embedded.com
·
19h
Caltech
’s
PrismML
shrinks AI models to fit your phone without losing their mind
🤖
LLMs
startupfortune.com
·
1d
The Data
Layer
Tax for Robot Learning
🤖
LLMs
rerun.io
·
2h
·
Hacker News
IT
engineer
by day, AI solutions founder by night — I was
drowning
in AI news so I built something to fix it
🛠️
AI Dev Tools
agent-builder-daily.vercel.app
·
16h
·
r/SideProject
Ways in which
GenAI
has changed the way I
write
code so far
🛠️
AI Dev Tools
lengrand.fr
·
4h
not much
happened
today
🛠️
AI Dev Tools
news.smol.ai
·
2d
vLLM-Lens
: Fast Interpretability
Tooling
That Scales to Trillion-Parameter Models
🤖
LLMs
lesswrong.com
·
6d
[
AINews
] The Inference
Inflection
🛠️
AI Dev Tools
latent.space
·
13h
GPU Scheduling in
Kubernetes
: Why It Starts Before the
Scheduler
📐
System Design
rack2cloud.com
·
2h
·
DEV
Agentic Data Engineering with
Genie
Code and
Lakeflow
🛠️
AI Dev Tools
databricks.com
·
2d
Scaling Pain of Coding Agent Serving: Lessons from
Debugging
GLM-5
at Scale
🤖
LLMs
z.ai
·
13h
·
Lobsters
Geniatech
AIM-M-K and AIM-B2 integrate
Ara240
for local AI inference
🛠️
AI Dev Tools
lxer.com
·
5h
Monitoring LLM behavior: Drift,
retries
, and
refusal
patterns
🛠️
AI Dev Tools
venturebeat.com
·
5d
·
Hacker News
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help