How LLM Inference Works
arpitbhayani.me·1d
🚀Tokenizer Performance
Flag this post
Learning AI From Scratch: Streaming Output, the Secret Sauce Behind Real-Time LLMs
🌊Streaming Lexers
Flag this post
Rad-tolerant MCUs cut space-grade costs
edn.com·2d
🔧RISC-V
Flag this post
Thoughts on the GigaOm Radar for Vector Databases v3
thenewstack.io·1d
🔍Query Engines
Flag this post
Building Tornago: A Go Library for Tor Integration Born from Fraud Prevention Needs
🌍Minimal HTTP
Flag this post
Running a 270M LLM on Android for Offline News Summarization (Notes and Code)
💬Smalltalk VMs
Flag this post
LAW-N: The Network Layer for Mind's Eye ## A Research-Backed Thesis on Context-Aware Data Movement for Mobile Cognitive Systems
📡Protocol Stacks
Flag this post
which one would you pick?
🖥️Minimal VMs
Flag this post
Parameterized complexity of scheduling unit-time jobs with generalized precedence constraints
arxiv.org·5d
⚡Partial Evaluation
Flag this post
[Product] I built 4 tools to supercharge Claude - Now available for purchase (code search, extended reasoning, more)
🌪️V8 Pipeline
Flag this post
I run my home lab 24/7, and I haven't gone bankrupt thanks to these 4 tweaks
xda-developers.com·17h
📊perf Tools
Flag this post
Zig in 30 Minutes
🦀MIR Optimization
Flag this post
Loading...Loading more...