Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
ML Hardware
🔲 ML Hardware
GPU, TPU, inference hardware, AI accelerators, CUDA
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81
posts in
29.3
ms
🤖
AI Research
DEV Community
·
4d
4 days ago
TPUs vs GPUs: How Google's
Tensor
Processing
Units
Actually Work
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for TPUs vs GPUs: How Google's Tensor Processing Units Actually Work
🔌
Embedded Systems
nvidia.github.io
·
1d
1 day ago
Zero-Copy Data Movement from NIC to
GPU
at 100s of Gbps
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Zero-Copy Data Movement from NIC to GPU at 100s of Gbps
🍎
Apple
GitHub
·
2h
2 hours ago
Show HN: Navatala
GPU
– multi-back end
GPU
kernels and Python bindings
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Navatala GPU – multi-back end GPU kernels and Python bindings
📐
Systems Design
AWS
·
1h
1 hour ago
Optimize
model
training
on Amazon SageMaker
AI
with NVIDIA Blackwell
Covers
NVIDIA Blackwell Architecture
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Optimize model training on Amazon SageMaker AI with NVIDIA Blackwell
🤖
LLM
HackerNoon
·
1d
1 day ago
Before the First Gradient: The Hidden Machinery Behind LLM
Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Before the First Gradient: The Hidden Machinery Behind LLM Training
🏗️
System Design
hardware.slashdot.org
·
21h
21 hours ago
OpenAI Unveils First Chip As
Part
of Broadcom Deal
Covered by
kite.kagi.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for OpenAI Unveils First Chip As Part of Broadcom Deal
🤖
LLM
arXiv
·
2d
2 days ago
An Empirical Study of OpenPangu Quantization on Ascend NPUs
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for An Empirical Study of OpenPangu Quantization on Ascend NPUs
💬
NLP
kaggle.com
·
4d
4 days ago
LoRA: I
Trained
<1% of a 1.5B
Model
and Matched a Full Fine-Tune
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune
🤖
LLM
supercomputing-system-ai-lab.github.io
·
1d
1 day ago
VoltanaLLM: Energy-Efficient LLM Serving
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for VoltanaLLM: Energy-Efficient LLM Serving
🤖
LLM
Engadget
·
21h
21 hours ago
Jalapeño is the first
AI
chip from OpenAI and Broadcom
Covered by
kite.kagi.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Jalapeño is the first AI chip from OpenAI and Broadcom
🤖
LLM
GitHub
·
5d
5 days ago
Show HN: NanoEuler – GPT-2 scale
model
in pure
C/CUDA
from scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch
☁️
Cloud Computing
modelplane.ai
·
2d
2 days ago
Modelplane
Covered by
The Crossplane Blog
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Modelplane
🪟
Context Windows
unsloth.ai
·
6d
6 days ago
GLM-5.2 – How to Run Locally
Covers
2 stories
See all stories this covers
including
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Covered by
6 sources
See all sources covering this story
including
tldr.tech
,
daemonology.net
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GLM-5.2 – How to Run Locally
🏢
Tech Industry
WIRED
·
1d
1 day ago
Qualcomm Buys Buzzy Chip Startup Modular for Nearly $4 Billion
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Qualcomm Buys Buzzy Chip Startup Modular for Nearly $4 Billion
🤖
LLM
Hacker News
·
1d
1 day ago
Ask HN: Are people generally interested using LLMs for learning purposes?
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Ask HN: Are people generally interested using LLMs for learning purposes?
🏢
Tech Industry
DEV Community
·
3d
3 days ago
When GPUs Are Scarce, Each Stall Costs N Times More
Covers
Anthropic in talks with investors to raise funds at $900 billion valuation, higher than OpenAI
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for When GPUs Are Scarce, Each Stall Costs N Times More
🤖
LLM
GitHub
·
2d
2 days ago
100+ t/s on Qwen3.6-27B Q8 across a 5090 + 3090 Ti — switching to
tensor
split-mode
got me from 70 to 100+
Covered by
NVIDIA Technical Blog
,
imil.net
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for 100+ t/s on Qwen3.6-27B Q8 across a 5090 + 3090 Ti — switching to tensor split-mode got me from 70 to 100+
🤖
LLM
The New Stack
·
3d
3 days ago
'"An LLM and a harness":
Nvidia
''s simple thesis on what agents actually are'
Covered by
GitHub
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for '"An LLM and a harness": Nvidia''s simple thesis on what agents actually are'
🏢
Tech Industry
TechCrunch
·
2d
2 days ago
AI
chipmaker Groq confirms $650M raise, re-staffs after
Nvidia
’s $20B not-acqui-hire deal
Covers
Groq Raises Another $650M
Covered by
Forward Future
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AI chipmaker Groq confirms $650M raise, re-staffs after Nvidia’s $20B not-acqui-hire deal
🔌
Embedded Systems
CNET
·
1h
1 hour ago
IBM's New Chip Fits Nearly 100 Billion Transistors in the Size of a Fingernail
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for IBM's New Chip Fits Nearly 100 Billion Transistors in the Size of a Fingernail
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report