Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔲 ML Hardware
GPU, TPU, inference hardware, AI accelerators, CUDA
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
11069
posts in
17.2
ms
An Efficient
Heterogeneous
Co-Design for
Fine-Tuning
on a Single GPU
arxiv.org
·
2d
⚡
Performance Engineering
Intrducing
momo-kiji
: CUDA for Apple Neural Engine
dev.to
·
1d
·
Discuss:
DEV
🤖
AI Research
Your GPU Is 97%
Utilized
But Your Training Is 3x
Slower
Than Expected
github.com
·
14h
·
Discuss:
DEV
⚡
Performance Engineering
LLM
Terminology
Guide:
Weights
, Inference, Effective sequence length, and Self-Hosting Explained
devforth.io
·
11h
·
Discuss:
Hacker News
🤖
LLM
Cost-Efficient Multimodal LLM Inference via
Cross-Tier
GPU
Heterogeneity
arxiv.org
·
4d
🤖
LLM
This video gives hope for unlocking the insane potential of the M5 GPU. It’s just that developers are only now being
provided
guides for optimizing
next-generat
...
youtube.com
·
3h
·
Discuss:
r/macgaming
⚡
Performance Engineering
Run any LLM on any hardware.
Auto-detects
your GPU, checks if the model
fits
github.com
·
2d
·
Discuss:
Hacker News
⚡
Performance Engineering
NumKong
: 2'000 Mixed Precision
Kernels
For All 🦍
ashvardanian.com
·
1d
·
Discuss:
Hacker News
🔌
Embedded Systems
FlexLink
: Boost GPU
Bandwidth
by 27% and Accelerate LLM Training by Unlocking Hidden Hardware Pathways
dev.to
·
1h
·
Discuss:
DEV
🏗️
System Design
solving y=mx+b... with
jax
on a
tpu
pod slice
matpalm.com
·
2d
🐦
Swift
I
trained
an anime image model in 2 days from
scratch
on 1 local GPU
huggingface.co
·
2d
·
Discuss:
r/StableDiffusion
👁️
Computer Vision
Scaling
Karpathy
's
Autoresearch
: What Happens When the Agent Gets a GPU Cluster
blog.skypilot.co
·
2d
·
Discuss:
Hacker News
,
r/vibecoding
🤖
AI Research
Show HN:
Clangd
for
CUDA
Device Code
docs.scale-lang.com
·
4d
·
Discuss:
Hacker News
⚡
Performance Engineering
Structured Resume Skill Extraction Using
Mistral-7B
Inference
digitalocean.com
·
1d
🧠
LLMs
Launch HN: Chamber (YC
W26
) – An AI
Teammate
for GPU Infrastructure
usechamber.io
·
4d
·
Discuss:
Hacker News
🤖
AI Research
Niv-AI
exits stealth to
wring
more power performance out of GPUs
techcrunch.com
·
3d
📐
Systems Design
Google AI Studio 2.0
producthunt.com
·
1d
🤖
AI Research
**Introducing SPEED-Bench: A Unified and
Diverse
Benchmark for
Speculative
Decoding**
huggingface.co
·
1d
·
Discuss:
Hacker News
⚡
Performance Engineering
Bayesian Neural Networks in {
tidymodels
} with {
kindling
}
r-bloggers.com
·
1d
🤖
LLM
Nvidia
greenboost
:
transparently
extend GPU VRAM using system RAM/NVMe
gitlab.com
·
2d
·
Discuss:
Lobsters
⚡
Performance Engineering
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help