Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🔲 ML Hardware
GPU, TPU, inference hardware, AI accelerators, CUDA
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
189
posts in
10.4
ms
Characterization of machine learning compilers for LLM
inference
on
NVIDIA
GPUs
🧠
LLMs
link.springer.com
·
4d
·
Hacker News
The
Model
Parking Tax: Quantifying the Hidden Energy Cost of Always-On
GPU
Model
Deployment
🤖
AI Research
arxiv.org
·
2d
CUDA
13.3 Lands,
AI
Writes Blackwell Kernels, & FP4
VRAM
Optimization for LLMs
⚡
Performance Engineering
dev.to
·
12h
·
DEV
Show HN: cuSBF – Faster
GPU
Bloom Filter for Sequence Data
⚡
Performance Engineering
github.com
·
16h
·
Hacker News
Build
high-performance
generative
AI
systems with Strands Agents,
NVIDIA
NIM, and Amazon Bedrock AgentCore
🕵️
AI Agents
aws.amazon.com
·
1d
I Made Local
AI
Faster Than the Cloud — A Complete Home Automation Voice Control Journey
⚡
Performance Engineering
linkedin.com
·
50m
·
DEV
Running Flux Schnell (12B) + LLMs on a Legacy
AMD
RX 580 (8GB) via Native Vulkan — Full Architecture Guide [2026]
⚡
Performance Engineering
setup-ia-local-rx580-vulkan.firebaseapp.com
·
5d
·
DEV
Dense vs MoE
Models
Explained
🧠
LLMs
engineersmeetai.substack.com
·
22h
·
Substack
AI
Datacenters Were Built for GPUs. What Happens When You Remove the GPUs?
🏗️
System Design
almartis.xyz
·
2d
·
Hacker News
Argonne flexes spare supercompute to build private
AI
inference
service
🤖
AI Research
theregister.com
·
13h
·
Hacker News
AI
Infrastructure Preflight at User space: Validating Multi Node, Multi
GPU
Slurm
Clusters
⚡
Performance Engineering
techcommunity.microsoft.com
·
5d
Not All On-Device
AI
Is The Same: How Chip Compute Tiers Decide What Your Product Can Actually Do
🔌
Embedded Systems
easelinktech.com
·
2d
·
Hacker News
The future of
AI
is an
AI
futures market
🤖
AI Research
semafor.com
·
1d
·
Hacker News
Nvidia
bets $150B on Taiwan as Trump's plan to make US an
AI
hub backfires
⚖️
Tech Policy
arstechnica.com
·
13h
Getting Started with Slinky on DigitalOcean Kubernetes
☁️
Cloud Computing
digitalocean.com
·
6d
Presentation: Designing
AI
Platforms for Reliability: Tools for Certainty, Agents for Discovery
🤖
AI Research
infoq.com
·
1d
NVIDIA
Removes Gaming Revenue Category From Financial Reports
⚖️
Tech Policy
guru3d.com
·
6d
·
Hacker News
,
r/LocalLLaMA
The Download: keeping up with
AI
, and the future of IVF
🤖
AI Research
technologyreview.com
·
16h
·
Hacker News
The Open/Closed Problem in
AI
🤖
AI Research
blog.mempko.com
·
5d
·
Lobsters
,
Hacker News
openbmb/MiniCPM5-1B
🧠
LLMs
huggingface.co
·
2d
·
r/LocalLLaMA
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help