Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Engineering
🤖 AI Engineering
AI engineer, ML pipelines, model deployment, inference
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
194
posts in
6.1
ms
Architecturally Significant
MLOps
Guidelines for ML
Model
Integration and
Deployment
: a Gray Literature Review
⚙️
MLOps
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Architecturally Significant MLOps Guidelines for ML Model Integration and Deployment: a Gray Literature Review
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🧠
LLMs
zozo123.github.io
·
9h
9 hours ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
Nvidia DGX Spark GB10 –
AI
Models
and Guide with
vLLM
and Autonomous Script
🧠
LLMs
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script
15 years of Software Center – A Look in the Mirror and over the Front Windshield
⚙️
MLOps
Content type:
Blog
metrics.blogg.gu.se
·
11h
11 hours ago
Actions for 15 years of Software Center – A Look in the Mirror and over the Front Windshield
2x GH200 for LLM
inference
, Part 2:
vLLM
, DeepSeek V4 Flash, and MTP
🧠
LLM Inference
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for 2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP
SDLC vs. AIDLC: Why Data
Engineering
is Pushing the Boundaries of Software Development
⚙️
MLOps
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for SDLC vs. AIDLC: Why Data Engineering is Pushing the Boundaries of Software Development
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🤖
Machine Learning
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
AMD's Lemonade SDK For Local
AI
Adds NVIDIA CUDA Support
🧠
LLMs
phoronix.com
·
3h
3 hours ago
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
Speculators v0.5.0: DFlash support and online training
🧠
LLMs
developers.redhat.com
·
6d
6 days ago
Actions for Speculators v0.5.0: DFlash support and online training
google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
🧠
LLM Inference
huggingface.co
·
2d
2 days ago
·
r/LocalLLaMA
Actions for google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
Predicting the World Cup Winner: Live Coding with Hopswor...
⚙️
MLOps
hopsworks.ai
·
1h
1 hour ago
·
Hacker News
Actions for Predicting the World Cup Winner: Live Coding with Hopswor...
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local
AI
🤖
ai
Content type:
Blog
blogs.nvidia.com
·
3h
3 hours ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
LLM
Inference
Engineering
Room — Part 3: The Orchestration Layer
🧠
LLMs
Content type:
Blog
vimal-dwarampudi.medium.com
·
6d
6 days ago
Actions for LLM Inference Engineering Room — Part 3: The Orchestration Layer
Distributed multi-agent systems with Aspire and Microsoft Agent Framework
⚙️
MLOps
Content type:
Blog
devblogs.microsoft.com
·
1d
1 day ago
Actions for Distributed multi-agent systems with Aspire and Microsoft Agent Framework
Tejas-TA/predikit: The missing bridge between your
ML
models
and your
AI
agents.
🤖
ai
Content type:
Code
github.com
·
40m
40 minutes ago
·
Hacker News
Actions for Tejas-TA/predikit: The missing bridge between your ML models and your AI agents.
Agent-as-a-Code in Databricks for Production
⚙️
MLOps
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for Agent-as-a-Code in Databricks for Production
Google releases Gemma 4 12B with encoder-free multimodal architecture
🧠
LLM Inference
4sysops.com
·
1d
1 day ago
Actions for Google releases Gemma 4 12B with encoder-free multimodal architecture
DiffusionGemma: The Developer Guide
🤖
ai
Content type:
Blog
developers.googleblog.com
·
19h
19 hours ago
Actions for DiffusionGemma: The Developer Guide
Location: Lubbock, TX, USA Remote: Yes (Remote-friendly, US-based) Technologies:...
🤖
Machine Learning
Content type:
Discussion
news.ycombinator.com
·
4h
4 hours ago
·
Hacker News
Actions for Location: Lubbock, TX, USA Remote: Yes (Remote-friendly, US-based) Technologies:...
Infrastructure Options for Scalable
AI
Inference
🧠
LLM Inference
Content type:
Blog
mirantis.com
·
20h
20 hours ago
Actions for Infrastructure Options for Scalable AI Inference
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help