Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Infrastructure
🏗️ AI Infrastructure
AI hardware, ML infrastructure, AI accelerator, inference server
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
375
posts in
6.4
ms
Vortex expands open RISC-V graphics
⚡
GPU Computing
jonpeddie.com
·
11h
11 hours ago
Actions for Vortex expands open RISC-V graphics
"
AI
" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
🤖
Machine Learning
Content type:
News
Content type:
Blog
braddelong.substack.com
·
2d
2 days ago
·
Substack
Actions for "AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
The Forbes 30 Under 30 CEO who left Lockheed Martin's Skunk Works raises $350M at $1.55B to challenge Nvidia's grip on
AI
infrastructure
— TFN
🟢
NVIDIA
techfundingnews.com
·
18h
18 hours ago
Actions for The Forbes 30 Under 30 CEO who left Lockheed Martin's Skunk Works raises $350M at $1.55B to challenge Nvidia's grip on AI infrastructure — TFN
Modernizing
attendance ticketing in SAS Viya using SAS Agentic
AI
Accelerator
💬
LLMs
Content type:
Blog
blogs.sas.com
·
1d
1 day ago
Actions for Modernizing attendance ticketing in SAS Viya using SAS Agentic AI Accelerator
How we fight
GPU
scarcity without compromise
🤖
Machine Learning
Content type:
Blog
equixly.com
·
5d
5 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
Apple WWDC On-Device
AI
Deep Dive - Google Docs
🌐
Networking
gist.is
·
10h
10 hours ago
·
Hacker News
Actions for Apple WWDC On-Device AI Deep Dive - Google Docs
Cohere open-sources a coding agent that runs on a single
H100
🟢
NVIDIA
venturebeat.com
·
1d
1 day ago
Actions for Cohere open-sources a coding agent that runs on a single H100
Build a Medical Report Analyzer on Dedicated
Inference
with Python
💬
LLMs
digitalocean.com
·
6d
6 days ago
Actions for Build a Medical Report Analyzer on Dedicated Inference with Python
DiffusionGemma is Google’s fastest
AI
yet, but it comes with a big trade-off
🟢
NVIDIA
androidauthority.com
·
1h
1 hour ago
Actions for DiffusionGemma is Google’s fastest AI yet, but it comes with a big trade-off
On-device
AI
is a margin decision
💬
LLMs
Content type:
Blog
ziraph.com
·
14h
14 hours ago
·
Hacker News
Actions for On-device AI is a margin decision
Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
🏢
Data Centers
Content type:
Blog
towardsai.net
·
2d
2 days ago
Actions for Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
Google's latest DiffusionGemma open
AI
model
comes with a 4x speed boost
🟢
NVIDIA
Content type:
News
arstechnica.com
·
12h
12 hours ago
Actions for Google's latest DiffusionGemma open AI model comes with a 4x speed boost
I Processed 2.4 Billion Tokens Across 52
AI
Models
for $0.52. Here's the Full Breakdown.
💬
LLMs
saintlex.sbs
·
4h
4 hours ago
·
DEV
Actions for I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.
Google Colab CLI opens runtimes to Claude Code and Codex
💬
LLMs
helpnetsecurity.com
·
3d
3 days ago
·
r/ClaudeAI
Actions for Google Colab CLI opens runtimes to Claude Code and Codex
MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is
Hardware-Dependent
💬
LLMs
Content type:
Blog
bric.pe.kr
·
2d
2 days ago
·
DEV
Actions for MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
AI
Serving Platform That Adapts to Your
Model
💬
LLMs
Content type:
Blog
databricks.com
·
16h
16 hours ago
Actions for AI Serving Platform That Adapts to Your Model
PCIe Benefits From
AI
, Despite Scaling Protocols
🌐
Networking
semiengineering.com
·
1d
1 day ago
Actions for PCIe Benefits From AI, Despite Scaling Protocols
Neo-X7/Neo-AI
: A fully offline
AI
assistant powered by
Ollama
. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
💾
AI Chips
Content type:
Code
github.com
·
19h
19 hours ago
·
DEV
Actions for Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
Best Stateful Sandboxes for Code Execution in 2026
☁️
Cloud Computing
Content type:
Blog
beam.cloud
·
5d
5 days ago
Actions for Best Stateful Sandboxes for Code Execution in 2026
PagedAttention vs Traditional KV Cache: How
vLLM
Reinvented
GPU
Memory for LLM
Inference
💬
LLMs
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for PagedAttention vs Traditional KV Cache: How vLLM Reinvented GPU Memory for LLM Inference
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help