Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
NVIDIA
🟢 NVIDIA
Specific
GPU, CUDA, NVIDIA hardware, graphics cards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
76
posts in
16.3
ms
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🧠
LLMs
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Release TorchCodec 0.14: HDR Video Decoding for CPU &
CUDA
, and Fast Wav Decoder · meta-pytorch/torchcodec
🎵
Vibe Coding
Content type:
Code
github.com
·
10h
10 hours ago
·
Hacker News
Actions for Release TorchCodec 0.14: HDR Video Decoding for CPU & CUDA, and Fast Wav Decoder · meta-pytorch/torchcodec
Expanding Private Cloud Compute - Apple Security Research
☁️
Cloud Computing
Content type:
Blog
security.apple.com
·
2d
2 days ago
·
Lobsters
,
Hacker News
,
r/apple
Actions for Expanding Private Cloud Compute - Apple Security Research
Train Models Faster with JAX and MaxText Using NVFP4 on
NVIDIA
Blackwell
🏠
Local LLMs
Content type:
News
Content type:
Blog
developer.nvidia.com
·
2d
2 days ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
NVIDIA
, KRAFTON, NC and Reigning ‘League of Legends’ Champions T1 Celebrate
RTX
Spark at Korea’s PC Bangs
🧠
Transformers
Content type:
Blog
blogs.nvidia.com
·
3d
3 days ago
Actions for NVIDIA, KRAFTON, NC and Reigning ‘League of Legends’ Champions T1 Celebrate RTX Spark at Korea’s PC Bangs
Apple rebuilt its on-device AI stack at WWDC 2026
🛠️
Developer Tools
Content type:
Blog
ziraph.com
·
1d
1 day ago
·
Hacker News
Actions for Apple rebuilt its on-device AI stack at WWDC 2026
CodegenBench: Can LLMs Write Efficient Code Across Architectures?
💻
Code Generation
Content type:
Academic
arxiv.org
·
6d
6 days ago
·
Hacker News
Actions for CodegenBench: Can LLMs Write Efficient Code Across Architectures?
KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4
GPU
(gfx1201): TurboQuant KV cache +
HIP-graph-safe
Flash-Attention for llama.cpp, fully measured on real
hardware
.
🤗
Open Source AI
Content type:
Code
github.com
·
9h
9 hours ago
·
Hacker News
Actions for KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.
Microsoft continues its big Linux push at Build 2026
☁️
Cloud Computing
zdnet.com
·
6d
6 days ago
·
Hacker News
Actions for Microsoft continues its big Linux push at Build 2026
Why Compiler Engineers Rarely Use Strassen's Algorithm for Fast Matrix Multiplications
🏗️
Software Architecture
Content type:
News
Content type:
Blog
leetarxiv.substack.com
·
2d
2 days ago
·
Substack
,
r/programming
Actions for Why Compiler Engineers Rarely Use Strassen's Algorithm for Fast Matrix Multiplications
On-device AI is a margin decision
🏠
Local LLMs
Content type:
Blog
ziraph.com
·
7h
7 hours ago
·
Hacker News
Actions for On-device AI is a margin decision
Fine-tune FLUX.2 [Klein] with a LoRA under 60 minutes
🤗
Open Source AI
Content type:
Blog
huggingface.co
·
6d
6 days ago
·
Hacker News
Actions for Fine-tune FLUX.2 [Klein] with a LoRA under 60 minutes
Huawei-led team claims it post-trained DeepSeek's 1.6-trillion-parameter model — 1,000 Ascend 910C chips used in training
🤗
Open Source AI
Content type:
News
tomshardware.com
·
4d
4 days ago
·
Hacker News
Actions for Huawei-led team claims it post-trained DeepSeek's 1.6-trillion-parameter model — 1,000 Ascend 910C chips used in training
NVIDIA
and LG Group Build an AI Factory to Advance Physical AI, Mobility and AI Infrastructure
🏗️
Software Architecture
Content type:
Blog
blogs.nvidia.com
·
2d
2 days ago
·
Hacker News
Actions for NVIDIA and LG Group Build an AI Factory to Advance Physical AI, Mobility and AI Infrastructure
Ideogram-4-FP8 Brings High-Quality Text-to-Image Generation to More
Hardware
✍️
Prompt Engineering
hackernoon.com
·
5d
5 days ago
Actions for Ideogram-4-FP8 Brings High-Quality Text-to-Image Generation to More Hardware
The Download: how the World Cup ball will fly and OpenAI’s “super app”
💻
Tech Industry
Content type:
News
technologyreview.com
·
2d
2 days ago
·
Hacker News
Actions for The Download: how the World Cup ball will fly and OpenAI’s “super app”
Apple Silicon's on-device AI bet hasn't moved – only the chip range that runs it
💻
Tech Industry
tbreak.com
·
5d
5 days ago
·
Hacker News
,
r/apple
Actions for Apple Silicon's on-device AI bet hasn't moved – only the chip range that runs it
Unpacking AI: The
Hardware
Behind AI
🕵️
Agentic AI
Content type:
News
pathtostaff.com
·
4d
4 days ago
·
Hacker News
Actions for Unpacking AI: The Hardware Behind AI
Scarcity is driving AI innovation outside Silicon Valley
📈
AI Industry
restofworld.org
·
6d
6 days ago
·
Hacker News
Actions for Scarcity is driving AI innovation outside Silicon Valley
bigattichouse/packed-twin-inference: PTI achieves ~2× throughput using a single quantized model (Q5_K_M or better) by running 4 generation streams in one batched decode call. The
GPU
loads model weights once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft model. No quality loss
🏠
Local LLMs
Content type:
Code
github.com
·
1d
1 day ago
·
r/LocalLLaMA
Actions for bigattichouse/packed-twin-inference: PTI achieves ~2× throughput using a single quantized model (Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads model weights once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft model. No quality loss
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help