The Real Cost of LLM Inference: Memory Bandwidth, Not FLOPs
dev.toยท1dยท
Discuss: DEV
๐Ÿ’พCache-Oblivious Algorithms
Flag this post
September 2024 Progress in Guaranteed Safe AI
lesswrong.comยท2d
๐Ÿ“Standard ML
Flag this post
10000
jro.sgยท18h
๐Ÿ“ฆExecutable Size
Flag this post
How LLM Inference Works
arpitbhayani.meยท1d
๐Ÿš€Tokenizer Performance
Flag this post
Discovering physical laws with parallel symbolic enumeration
nature.comยท1d
๐Ÿ”ML Language
Flag this post
Why DETRs are replacing YOLOs for real-time object detection
blog.datameister.aiยท19hยท
Discuss: Hacker News
โœจEffect Inference
Flag this post
Arc Is a Vision Problem
arxiviq.substack.comยท17hยท
Discuss: Substack
๐ŸŒฑMinimal ML
Flag this post
The Machine Learning Roadmap
github.comยท8hยท
Discuss: Hacker News
๐ŸŒฑMinimal ML
Flag this post
Multi-Core Architecture Optimized For Time-Predictable Neural Network Inference (FZI, KIT)
semiengineering.comยท1d
๐Ÿ”ฎCPU Branch Prediction
Flag this post
Fine-tuning & RAG Strategy for Academic Research ( I Need a Sanity Check on Model Choice)
reddit.comยท19hยท
Discuss: r/LLM
๐Ÿ“กErlang BEAM
Flag this post
Zoomer: Powering AI Performance at Metaโ€™s Scale Through Intelligent Debugging and Optimization
engineering.fb.comยท1d
๐Ÿ“ˆPerformance Tools
Flag this post
The Engineering Guide to Efficient LLM Inference: Metrics, Memory, and Mathematics
pub.towardsai.netยท2d
โšกTokenizer Optimization
Flag this post
An overview of memory management in Go (2021)
medium.comยท13hยท
Discuss: Hacker News
๐Ÿ“šStack Data Structures
Flag this post
Apple Machine Learning Research at NeurIPS 2025
machinelearning.apple.comยท2d
โœจEffect Inference
Flag this post
On Thread Synchronization : Part 1 - A deep dive into mutexes
sayujya-apte.github.ioยท20hยท
Discuss: r/programming
๐Ÿ”—Concurrency Primitives
Flag this post
AMS-KV: Adaptive KV Caching in Multi-Scale Visual Autoregressive Transformers
arxiv.orgยท2d
๐Ÿ“‹JSON Parsing
Flag this post
Trying Out C++26 Executors
mropert.github.ioยท11hยท
๐Ÿ”ฎSpeculative Execution
Flag this post
Automated High-Throughput Functional Protein Screening via Graph-Neural Network Enhanced Microfluidics
dev.toยท3hยท
Discuss: DEV
๐Ÿ“‹JSON Parsing
Flag this post