Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
NVIDIA
🟢 NVIDIA
Specific
GPU, CUDA, NVIDIA hardware, graphics cards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
77
posts in
9.0
ms
NVIDIA
Accelerates Google DeepMind’s DiffusionGemma for Local AI
🤗
Open Source AI
Content type:
Blog
blogs.nvidia.com
·
7h
7 hours ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
KJLdefeated/RL.cu
: RLVR training for LLM in CUDA/C++
🧠
LLMs
Content type:
Code
github.com
·
3d
3 days ago
·
Hacker News
Actions for KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++
1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
🧠
LLMs
smolhub.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
DiffusionGemma: 4x Faster Text Generation
🤗
Open Source AI
Content type:
News
Content type:
Blog
blog.google
·
7h
7 hours ago
·
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis
🤗
Hugging Face
Content type:
Academic
arxiv.org
·
1d
1 day ago
·
Hacker News
Actions for AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis
Luce Spark: a 35B MoE on a 16 GB
GPU
, without the offload tax
🏠
Local LLMs
Content type:
Blog
lucebox.com
·
5d
5 days ago
·
Hacker News
Actions for Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax
Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent
🔬
ML Research
Content type:
News
spectrum.ieee.org
·
12h
12 hours ago
·
Hacker News
Actions for Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent
Less-relevant results
Show HN: Monitoring Confidential Inference Providers
🔶
Cloudflare
Content type:
Discussion
confidentialinference.net
·
2d
2 days ago
·
Hacker News
Actions for Show HN: Monitoring Confidential Inference Providers
🫧 AI Companies' Shared Destiny Recalls Dot-Com Bubble Memories
📈
AI Industry
Content type:
Discussion
bullbear.ninja
·
3d
3 days ago
·
Hacker News
Actions for 🫧 AI Companies' Shared Destiny Recalls Dot-Com Bubble Memories
DiffusionGemma: The Developer Guide- Google Developers Blog
🧠
LLMs
Content type:
Blog
developers.googleblog.com
·
23h
23 hours ago
·
r/LocalLLaMA
Actions for DiffusionGemma: The Developer Guide- Google Developers Blog
Upstart chipmakers keep challenging
Nvidia
. This time it's Microsoft-backed D-Matrix
💻
Tech Industry
Content type:
News
cnbc.com
·
1d
1 day ago
·
Hacker News
Actions for Upstart chipmakers keep challenging Nvidia. This time it's Microsoft-backed D-Matrix
Running Qwen 35B MoE at 450k Context on a Single 32GB
GPU
🏠
Local LLMs
local-llm.utop.workers.dev
·
3d
3 days ago
·
Hacker News
Actions for Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
Apple WWDC On-Device AI Deep Dive - Google Docs
🧠
LLMs
gist.is
·
1h
1 hour ago
·
Hacker News
Actions for Apple WWDC On-Device AI Deep Dive - Google Docs
Google Will Pay SpaceX $920 Million Per Month For Compute - Slashdot
💻
Tech Industry
hardware.slashdot.org
·
5d
5 days ago
Actions for Google Will Pay SpaceX $920 Million Per Month For Compute - Slashdot
Remove padding and multiple D2D copies for MTP by
gaugarg-nv
· Pull Request #24086 · ggml-org/llama.cpp
⚙️
DevOps
Content type:
Code
github.com
·
5h
5 hours ago
·
r/LocalLLaMA
Actions for Remove padding and multiple D2D copies for MTP by gaugarg-nv · Pull Request #24086 · ggml-org/llama.cpp
NVIDIA
Confidential Computing to Help Expand Apple’s Private Cloud Compute
🎵
Vibe Coding
Content type:
Blog
blogs.nvidia.com
·
1d
1 day ago
Actions for NVIDIA Confidential Computing to Help Expand Apple’s Private Cloud Compute
The Death of the App: Why Jensen Huang Just Blew Up the 40-Year-Old PC Bargain
🏗️
Software Architecture
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for The Death of the App: Why Jensen Huang Just Blew Up the 40-Year-Old PC Bargain
Apple's New AI Models Contain 'None' of Google's Gemini Assistant
🤗
Open Source AI
Content type:
News
macrumors.com
·
1d
1 day ago
·
Hacker News
Actions for Apple's New AI Models Contain 'None' of Google's Gemini Assistant
GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
🏠
Local LLMs
vettedconsumer.com
·
4d
4 days ago
·
Hacker News
Actions for GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🧠
LLMs
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help