Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Engineering
🤖 AI Engineering
AI engineer, ML pipelines, model deployment, inference
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
193
posts in
8.0
ms
Architecturally Significant
MLOps
Guidelines for ML
Model
Integration and
Deployment
: a Gray Literature Review
⚙️
MLOps
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Architecturally Significant MLOps Guidelines for ML Model Integration and Deployment: a Gray Literature Review
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🧠
LLMs
zozo123.github.io
·
2h
2 hours ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
Nvidia DGX Spark GB10 –
AI
Models
and Guide with
vLLM
and Autonomous Script
🧠
LLMs
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script
15 years of Software Center – A Look in the Mirror and over the Front Windshield
⚙️
MLOps
Content type:
Blog
metrics.blogg.gu.se
·
5h
5 hours ago
Actions for 15 years of Software Center – A Look in the Mirror and over the Front Windshield
2x GH200 for LLM
inference
, Part 2:
vLLM
, DeepSeek V4 Flash, and MTP
🧠
LLM Inference
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for 2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP
SDLC vs. AIDLC: Why Data
Engineering
is Pushing the Boundaries of Software Development
⚙️
MLOps
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development
Distributed multi-agent systems with Aspire and Microsoft Agent Framework
⚙️
MLOps
Content type:
Blog
devblogs.microsoft.com
·
20h
20 hours ago
Actions for Distributed multi-agent systems with Aspire and Microsoft Agent Framework
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🤖
Machine Learning
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Google releases Gemma 4 12B with encoder-free multimodal architecture
🧠
LLM Inference
4sysops.com
·
22h
22 hours ago
Actions for Google releases Gemma 4 12B with encoder-free multimodal architecture
Speculators v0.5.0: DFlash support and online training
🧠
LLMs
developers.redhat.com
·
6d
6 days ago
Actions for Speculators v0.5.0: DFlash support and online training
google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
🧠
LLM Inference
huggingface.co
·
2d
2 days ago
·
r/LocalLLaMA
Actions for google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
LLM
Inference
Engineering
Room — Part 3: The Orchestration Layer
🧠
LLMs
Content type:
Blog
vimal-dwarampudi.medium.com
·
6d
6 days ago
Actions for LLM Inference Engineering Room — Part 3: The Orchestration Layer
New comment by HorizonFlowLive in "Ask HN: Who wants to be hired? (June 2026)"
🧠
LLMs
Content type:
Discussion
news.ycombinator.com
·
1d
1 day ago
·
Hacker News
Actions for New comment by HorizonFlowLive in "Ask HN: Who wants to be hired? (June 2026)"
Infrastructure Options for Scalable
AI
Inference
🧠
LLM Inference
Content type:
Blog
mirantis.com
·
14h
14 hours ago
Actions for Infrastructure Options for Scalable AI Inference
fix(gateway): fail closed for unknown
model
auth · openclaw/openclaw@85343ea
🧠
LLMs
Content type:
Code
github.com
·
5d
5 days ago
Actions for fix(gateway): fail closed for unknown model auth · openclaw/openclaw@85343ea
Day 07 of
MLOps
: Hands-On Experiment Tracking for Machine Learning
Models
⚙️
MLOps
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Day 07 of MLOps: Hands-On Experiment Tracking for Machine Learning Models
Agent-as-a-Code in Databricks for Production
⚙️
MLOps
Content type:
Blog
medium.com
·
4d
4 days ago
Actions for Agent-as-a-Code in Databricks for Production
Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker
AI
⚙️
MLOps
Content type:
Blog
aws.amazon.com
·
16h
16 hours ago
Actions for Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker AI
Breaking the Ice: Analyzing Cold Start Latency in
vLLM
🧠
LLM Inference
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Breaking the Ice: Analyzing Cold Start Latency in vLLM
Token4Token — pay-per-token
inference
on Gnosis + Swarm
🧠
LLMs
t4t.eth.link
·
1d
1 day ago
·
Hacker News
Actions for Token4Token — pay-per-token inference on Gnosis + Swarm
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help