Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Infrastructure
⚙️ AI Infrastructure
AI stack, model serving, inference, ML infrastructure
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
215
posts in
8.0
ms
OpenCV 5.0 Computer Vision Library Released with Rewritten DNN
Engine
📊
AI Monitoring
linuxiac.com
·
2d
2 days ago
Actions for OpenCV 5.0 Computer Vision Library Released with Rewritten DNN Engine
CommBench: Can LLMs Write Correct and Efficient
GPU
Communication Code?
🧠
LLMs
uccl-project.github.io
·
6h
6 hours ago
·
Hacker News
Actions for CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?
Latest technical articles & videos.
📊
AI Monitoring
certdepot.net
·
5d
5 days ago
Actions for Latest technical articles & videos.
High Bandwidth Flash | A New Memory for
AI
Data Centers and Edge Computing | Sandisk
🎯
AI Alignment
ncnonline.net
·
2d
2 days ago
Actions for High Bandwidth Flash | A New Memory for AI Data Centers and Edge Computing | Sandisk
DiffusionGemma: 4x Faster Text Generation
🔍
GEO
Content type:
News
Content type:
Blog
blog.google
·
21h
21 hours ago
·
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
Why agentic
AI
needs an open
inference
stack
📊
AI Monitoring
redhat.com
·
3d
3 days ago
Actions for Why agentic AI needs an open inference stack
DiffusionGemma: The Developer Guide- Google Developers Blog
🔍
GEO
Content type:
Blog
developers.googleblog.com
·
1d
1 day ago
·
r/LocalLLaMA
Actions for DiffusionGemma: The Developer Guide- Google Developers Blog
AMD's Lemonade SDK For Local
AI
Adds NVIDIA CUDA Support
🧠
LLMs
phoronix.com
·
20h
20 hours ago
·
r/artificial
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
A system programmer’s guide to LLM
inference
🧠
LLMs
Content type:
Blog
blog.xiangpeng.systems
·
3d
3 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
Monitor Nebius
AI
Cloud with Datadog
📊
AI Monitoring
Content type:
Blog
datadoghq.com
·
2d
2 days ago
Actions for Monitor Nebius AI Cloud with Datadog
Predicting the World Cup Winner: Live Coding with Hopswor...
🧑💻
Indie Hackers
hopsworks.ai
·
19h
19 hours ago
·
Hacker News
Actions for Predicting the World Cup Winner: Live Coding with Hopswor...
How we fight
GPU
scarcity without compromise
🔍
GEO
Content type:
Blog
equixly.com
·
6d
6 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
sgl-project/sglang-omni
: SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni
Models
🧠
LLMs
Content type:
Code
github.com
·
1d
1 day ago
Actions for sgl-project/sglang-omni: SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
What Network Data Can and Can’t Tell Us About
AI
Infrastructure
📊
AI Monitoring
Content type:
Blog
backblaze.com
·
16h
16 hours ago
Actions for What Network Data Can and Can’t Tell Us About AI Infrastructure
China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)
🧠
LLMs
Content type:
News
decrypt.co
·
2d
2 days ago
·
Hacker News
Actions for China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)
Gemma 4 QAT
models
: Optimizing model
compression
for mobile and laptop efficiency
🧠
LLMs
Content type:
News
Content type:
Blog
blog.google
·
5d
5 days ago
·
Hacker News
Actions for Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency
Apple WWDC On-Device
AI
Deep Dive - Google Docs
🛠️
Developer Tools
gist.is
·
15h
15 hours ago
·
Hacker News
Actions for Apple WWDC On-Device AI Deep Dive - Google Docs
Model
Evaluations: Prove Your Routing Policy Actually Works
📊
AI Monitoring
Content type:
Blog
digitalocean.com
·
6d
6 days ago
Actions for Model Evaluations: Prove Your Routing Policy Actually Works
RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM
Inference
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference
For Robotaxis, Safety Must Be Built In, Not Bolted On
🧩
Epistemics
Content type:
Blog
blogs.nvidia.com
·
18h
18 hours ago
Actions for For Robotaxis, Safety Must Be Built In, Not Bolted On
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help