FP8 Training

Feeds to Scour
SubscribedAll
Scoured 35 posts in 14.6 ms

"North Mini Code"; open weights, 30B param, Canadian coding model

 ⏱️Prefill Decoding  Content type: Blog
cohere.com··Hacker News

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 💾KV Cache  Content type: Code
github.com··r/LocalLLaMA

P-Cast Precision in FP8 Attention: Sink-Induced Collapse and the Optimality of S=2^8

 💰Inference Cost  Content type: Academic
arxiv.org·

Show HN: AutoGPU – AI designs a real 7nm GPU, from Verilog to GDSII

 🧠HBM Bandwidth  Content type: Code
github.com··Hacker News

DJI 20260603031949 0009 D CHAR withad ARTOP 500 Local LLM Motherboard with GRX50 and RTX 5090

 🧠HBM Bandwidth
armdevices.net·

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 💾KV Cache  Content type: Code
github.com··Hacker News

SpenseGPT: Practical One-shot Pruning Enabling Sparse and Dense GEMMs for LLM Inference

 💰Inference Cost  Content type: Academic
arxiv.org·

Intel's mysterious new datacenter GPU is what Nvidia's Rubin CPX nearly was

 🟢CUDA
theregister.com·

How the UK Is Turning Sovereign AI Ambition Into Action With NVIDIA Technologies

 🧠Inference Engineering  Content type: Blog
blogs.nvidia.com·

Forlinx launches Rockchip RK3572 system-on-module (SoM) and development board with Linux 6.12 BSP - CNX Software

 🪄Chiplet Design
cnx-software.com·

Ablation Study of Block Size, Weight Precision, and Scale Precision in NVFP4 Inference for Low-Power Edge-Efficient Neural Networks

 🚀Speculative Decoding  Content type: Academic
arxiv.org·

FSR 4.1 made older Radeon cards interesting again, but not for the reason AMD wants

 🎮GPU Computing
xda-developers.com·

Build a local voice agent with Red Hat OpenShift AI

 🎮GPU Computing
developers.redhat.com·

Does anyone know what PCIe mode was used for these benchmarks?

 💾KV Cache  Content type: Code
github.com··r/LocalLLaMA

[AINews] not much happened today

 💰Inference Cost  Content type: News
latent.space
·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help