Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Infrastructure
🏗️ AI Infrastructure
AI hardware, ML infrastructure, AI accelerator, inference server
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
370
posts in
7.1
ms
Data center
infrastructure
startup TensorWave raises $350M to help break Nvidia’s
AI
chip monopoly
⚡
GPU Computing
siliconangle.com
·
4h
4 hours ago
Actions for Data center infrastructure startup TensorWave raises $350M to help break Nvidia’s AI chip monopoly
NexusOS v2.0 – A zero-dependency pipeline streaming
server
chaos to Parquet
🔧
MLOps
huggingface.co
·
2d
2 days ago
·
Hacker News
Actions for NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
DiffusionGemma: 4x Faster Text Generation
💬
LLMs
Content type:
News
Content type:
Blog
blog.google
·
14h
14 hours ago
·
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
2x GH200 for LLM
inference
, Part 2:
vLLM
, DeepSeek V4 Flash, and MTP
💬
LLMs
Content type:
Blog
dnhkng.github.io
·
3d
3 days ago
Actions for 2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP
GPUsnek is Python on nVidia’s
CUDA
⚡
GPU Computing
Content type:
Blog
blog.adafruit.com
·
10h
10 hours ago
Actions for GPUsnek is Python on nVidia’s CUDA
Report: GKE
Inference
Gateway delivers up to 92% faster
AI
responses
💬
LLMs
Content type:
Blog
cloud.google.com
·
2d
2 days ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
Gemma 4 QAT on 10GB Laptop: Local
AI
with 6.7GB VRAM
💾
AI Chips
everylocalai.com
·
10h
10 hours ago
·
DEV
Actions for Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM
Using Scikit-LLM with Open-Source LLMs
💬
LLMs
machinelearningmastery.com
·
6d
6 days ago
Actions for Using Scikit-LLM with Open-Source LLMs
Monitor Nebius
AI
Cloud with Datadog
⚡
GPU Computing
Content type:
Blog
datadoghq.com
·
2d
2 days ago
Actions for Monitor Nebius AI Cloud with Datadog
Google's new open
model
DiffusionGemma generates text from noise instead of word by word
🟢
NVIDIA
the-decoder.com
·
11h
11 hours ago
Actions for Google's new open model DiffusionGemma generates text from noise instead of word by word
DiffusionGemma: The Developer Guide
💬
LLMs
Content type:
Blog
developers.googleblog.com
·
1d
1 day ago
Actions for DiffusionGemma: The Developer Guide
How we fight
GPU
scarcity without compromise
🤖
Machine Learning
Content type:
Blog
equixly.com
·
5d
5 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
Neo-X7/Neo-AI
: A fully offline
AI
assistant powered by
Ollama
. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
💾
AI Chips
Content type:
Code
github.com
·
18h
18 hours ago
·
DEV
Actions for Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
Tales of an
Ollama
Honeypot (Part 3): More Traffic, More Findings
🌐
Networking
posts.inthecyber.com
·
2d
2 days ago
Actions for Tales of an Ollama Honeypot (Part 3): More Traffic, More Findings
WSL 3 will finally let Linux apps use your
GPU
and
NPU
without the performance tax
⚡
GPU Computing
xda-developers.com
·
14h
14 hours ago
Actions for WSL 3 will finally let Linux apps use your GPU and NPU without the performance tax
Breaking the Ice: Analyzing Cold Start Latency in
vLLM
💬
LLMs
Content type:
Academic
arxiv.org
·
3d
3 days ago
·
Hacker News
Actions for Breaking the Ice: Analyzing Cold Start Latency in vLLM
google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
🔧
MLOps
huggingface.co
·
2d
2 days ago
·
r/LocalLLaMA
Actions for google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
Intel XPU Manager 2.0 Overhauls Windows & Linux Management For Arc Pro GPUs
💾
AI Chips
phoronix.com
·
11h
11 hours ago
Actions for Intel XPU Manager 2.0 Overhauls Windows & Linux Management For Arc Pro GPUs
Vortex expands open RISC-V graphics
⚡
GPU Computing
jonpeddie.com
·
9h
9 hours ago
Actions for Vortex expands open RISC-V graphics
Breaking free of a single datacenter: Practical
geo-distributed
AI
operations with the k0smos platforms
🏢
Data Centers
Content type:
Blog
cncf.io
·
2d
2 days ago
Actions for Breaking free of a single datacenter: Practical geo-distributed AI operations with the k0smos platforms
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help