Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
FP8 Training
🔢 FP8 Training
Specific
FP8, float8, mixed precision, H100 transformer engine
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
35
posts in
14.6
ms
"North Mini Code"; open weights, 30B param, Canadian coding model
⏱️
Prefill Decoding
Content type:
Blog
cohere.com
·
1d
1 day ago
·
Hacker News
Actions for "North Mini Code"; open weights, 30B param, Canadian coding model
heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.
💾
KV Cache
Content type:
Code
github.com
·
3d
3 days ago
·
r/LocalLLaMA
Actions for heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.
P-Cast
Precision
in
FP8
Attention: Sink-Induced Collapse and the Optimality of S=2^8
💰
Inference Cost
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for P-Cast Precision in FP8 Attention: Sink-Induced Collapse and the Optimality of S=2^8
Show HN: AutoGPU – AI designs a real 7nm GPU, from Verilog to GDSII
🧠
HBM Bandwidth
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for Show HN: AutoGPU – AI designs a real 7nm GPU, from Verilog to GDSII
DJI 20260603031949 0009 D CHAR withad ARTOP 500 Local LLM Motherboard with GRX50 and RTX 5090
🧠
HBM Bandwidth
armdevices.net
·
6d
6 days ago
Actions for DJI 20260603031949 0009 D CHAR withad ARTOP 500 Local LLM Motherboard with GRX50 and RTX 5090
harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
💾
KV Cache
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
SpenseGPT: Practical One-shot Pruning Enabling Sparse and Dense GEMMs for LLM Inference
💰
Inference Cost
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for SpenseGPT: Practical One-shot Pruning Enabling Sparse and Dense GEMMs for LLM Inference
Intel's mysterious new datacenter GPU is what
Nvidia
's Rubin CPX nearly was
🟢
CUDA
theregister.com
·
6d
6 days ago
Actions for Intel's mysterious new datacenter GPU is what Nvidia's Rubin CPX nearly was
How the UK Is Turning Sovereign AI Ambition Into Action With
NVIDIA
Technologies
🧠
Inference Engineering
Content type:
Blog
blogs.nvidia.com
·
2d
2 days ago
Actions for How the UK Is Turning Sovereign AI Ambition Into Action With NVIDIA Technologies
Forlinx
launches
Rockchip RK3572 system-on-module (SoM) and development board with Linux 6.12 BSP - CNX Software
🪄
Chiplet Design
cnx-software.com
·
6d
6 days ago
Actions for Forlinx launches Rockchip RK3572 system-on-module (SoM) and development board with Linux 6.12 BSP - CNX Software
Ablation Study of Block Size, Weight
Precision
, and Scale
Precision
in NVFP4 Inference for Low-Power Edge-Efficient Neural Networks
🚀
Speculative Decoding
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Ablation Study of Block Size, Weight Precision, and Scale Precision in NVFP4 Inference for Low-Power Edge-Efficient Neural Networks
FSR 4.1 made older Radeon cards interesting again, but not for the reason AMD wants
🎮
GPU Computing
xda-developers.com
·
6d
6 days ago
Actions for FSR 4.1 made older Radeon cards interesting again, but not for the reason AMD wants
Build a local voice agent with Red Hat OpenShift AI
🎮
GPU Computing
developers.redhat.com
·
2d
2 days ago
Actions for Build a local voice agent with Red Hat OpenShift AI
Does anyone know what PCIe mode was used for these benchmarks?
💾
KV Cache
Content type:
Code
github.com
·
4d
4 days ago
·
r/LocalLLaMA
Actions for Does anyone know what PCIe mode was used for these benchmarks?
[AINews] not much happened today
💰
Inference Cost
Content type:
News
latent.space
·
4d
4 days ago
Actions for [AINews] not much happened today
« Page 1
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help