Disaggregated Inference at Scale with PyTorch and VLLM
pytorch.org·17h·
Discuss: Hacker News
🧠LLM Inference
How Quantized Models Are Making AI Faster on Mobile
lktechacademy.com·7h·
Discuss: r/LocalLLaMA
🧠LLM Inference
I built an LLM from Scratch in Rust (Just ndarray and rand)
github.com·5h·
Discuss: r/rust
🕯️Candle
How to Train an LLM-Recommender Hybrid that Speaks English & Item IDs
eugeneyan.com·21h
🕸️Sparse Vectors
[CS 2881r AI Safety] [Week 1] Introduction
lesswrong.com·1h
🛡️AI Safety
AI Companies School Like Fish
dbreunig.com·23h·
Discuss: Hacker News
🚀Startups
A Dumb Introduction to z3. Exploring the world of constraint solvers with very simple examples.
asibahi.github.io·25m·
Discuss: r/programming
🧮SMT Solvers
Cross-Domain Misalignment Generalization: Role Inference vs. Weight Corruption
echoesofvastness.substack.com·22h·
🛡️AI Safety
Beyond Traditional Pseudorandomness, Tsotchkes' Quantum Random Number Generation
medium.com·21h·
Discuss: Hacker News
🎯Qdrant
original ↗
blog.djnavarro.net·21h
📑Inverted Indexes
Soil’s “dark” microbes open the door to future antibiotics
nature.com·5h
🍄Mycorrhizal Networks
New Light-Based Chip Supercharges AI Efficiency by up to 100x
scitechdaily.com·6h
🖥GPUs
🎲 How to Use KurrentDB for Event Sourcing in C# on Azure
nestenius.se·10h
🌫️Turso
H100 PCIe – 1.86 TB/s memcpy roofline and 8× uplift
news.ycombinator.com·20h·
Discuss: Hacker News
⚙️Mechanical Sympathy
How to Make a Simple MOSFET Tester
hackaday.com·22h
💻Chips
Calculus Made Easy by Silvanus P. Thompson
calculusmadeeasy.org·14h·
Discuss: Hacker News
💻Programming languages
🧵HOW BITCOIN WILL END EVERY WAR
threadreaderapp.com·1h
💵Economic Statecraft
Reaching Across the Isles: UK-LLM Brings AI to UK Languages With NVIDIA Nemotron
blogs.nvidia.com·20h
🤖AI
Cyberport may use Chinese GPUs at supercomputing centre, CEO says
scmp.com·12h·
Discuss: r/SCMPauto
🖥GPUs
Demis Hassabis en el podcast Release Notes
domingogallardo.bearblog.dev·3h
🆕New AI