Disaggregated Inference at Scale with PyTorch and VLLM
pytorch.org·22h·
Discuss: Hacker News
🧠LLM Inference
Retro x86 with 486Tang
hackaday.com·6h
🔐Hardware Security
[CS 2881r AI Safety] [Week 1] Introduction
lesswrong.com·6h
🛡️AI Safety
Select Qualcomm X Elite Laptops Seeing IRIS Video Acceleration On Linux
phoronix.com·16h·
Discuss: r/linux
🧰Framework
Refurb weekend: Silicon Graphics Indigo² IMPACT 10000
oldvcr.blogspot.com·21h·
⚙️Mechanical Sympathy
New Light-Based Chip Supercharges AI Efficiency by up to 100x
scitechdaily.com·12h
🖥GPUs
vLLM on consumer grade Blackwell with NVFP4 models - anyone actually managed to run these?
reddit.com·13h·
Discuss: r/LocalLLaMA
📊Model Serving Economics
Nvidia rumored to ditch its first-gen custom memory form factor for newer version — SOCAMM1 for faster ‘SOCAMM2’ standard
tomshardware.com·13h
🖥GPUs
I built an LLM from Scratch in Rust (Just ndarray and rand)
github.com·10h·
Discuss: r/rust
🕯️Candle
How Quantized Models Are Making AI Faster on Mobile
lktechacademy.com·13h·
Discuss: r/LocalLLaMA
🧠LLM Inference
Machines of Loving Grace
darioamodei.com·9h·
Discuss: Hacker News
🆕New AI
Cyberport may use Chinese GPUs at supercomputing centre, CEO says
scmp.com·17h·
Discuss: r/SCMPauto
🖥GPUs
Rigetti Computing Gets Closer To Crucial Quantum Milestone (Rating Upgrade)
seekingalpha.com·21h
🏆Ranking
Anthropic markets revolutionary AI agent technology as mundane spreadsheet tools
nearlyright.com·6h
🎭Claude
The pirate-based logic of Rust shared references
ais523.me.uk·8h
🦀Rust Compiler Internals
The MOTIF Hand: A tool advancing the capabilities of previous robot hand technology
techxplore.com·12h
🤖Home Assistant
Deep Dive into SATA, USB and PCI Express on AMD Turin
blog.3mdeb.com·5h·
Discuss: Hacker News
🔐Hardware Security
Trigger crossbar
serd.es·8h·
Discuss: Hacker News
🔐Hardware Security
Show HN: Free Volume Shader BM – a browser-based GPU performance benchmark
volumeshaderbm.org·13h·
Discuss: Hacker News
🖥GPUs