LLM Optimization Notes: Memory, Compute and Inference Techniques
gaurigupta19.github.ioยท9hยท
Discuss: Hacker News
๐Ÿ’ปLocal LLMs
Learning to Route: A Rule-Driven Agent Framework for Hybrid-Source Retrieval-Augmented Generation
arxiv.orgยท21h
๐Ÿ”Information Retrieval
Troops Head to Chicago, Despite Efforts to Block Them
nytimes.comยท3h
๐Ÿ“กRSS
Chat Smarter, Not Harder: Building an AI Chat Interface in Your Angular App
dev.toยท3hยท
Discuss: DEV
๐Ÿ”ŒInterface Evolution
Beating the L1 cache with value speculation (2021)
mazzo.liยท9hยท
Discuss: Lobsters
โšกCPU Microarchitecture
Is Odin Just a More Boring C?
dayvster.comยท14hยท
Discuss: Hacker News
๐Ÿ”ฉSystems Programming
Achieving 1.2 TB/s Aggregate Bandwidth by Optimizing Distributed Cache Network
juicefs.comยท1dยท
Discuss: Hacker News
๐Ÿ“กNetwork Stack
Eliminating the Precisionโ€“Latency Trade-Off in Large-Scale RAG
thenewstack.ioยท3d
๐ŸŽฏRetrieval Systems
LibreQoS v1.5-RC-2
libreqos.ioยท4h
๐Ÿ”ŒInterface Evolution
Introducing OpenZL: An Open Source Format-Aware Compression Framework
engineering.fb.comยท9hยท
โšกModern Compression
A Global Mining Dataset
tech.marksblogg.comยท14hยท
Discuss: Hacker News
๐Ÿ“ฆMETS Containers
Token economics are serious AI business; API costs are out of control
theservitor.comยท16hยท
Discuss: Hacker News
๐ŸŒ€Brotli Internals
Self-Extracting F3
buttondown.comยท7hยท
Discuss: Hacker News
โœ…Format Verification
Measuring scaleup for Postgres 18.0 with sysbench
smalldatum.blogspot.comยท1dยท
๐Ÿ“ŠPerformance Profiling
Database Transactions: Everything That Can Go Wrong When Using Them
hackernoon.comยท3h
๐Ÿ“Database WAL
Show HN: I Built a Transcription CLI Because Uploading 4GB Videos Was Killing Me
medium.comยท6hยท
Discuss: Hacker News
๐Ÿ’ฟFLAC Archaeology
Krish Naik: Complete RAG Crash Course With Langchain In 2 Hours
dev.toยท5hยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
Property-based testing of batch-invariant operations
mmaaz.caยท1dยท
Discuss: Hacker News
๐ŸงชProperty-Based Testing
We built an open source SLURM replacement for ML training workloads built on SkyPilot, Ray and K8s.
reddit.comยท6hยท
Discuss: r/kubernetes
๐ŸŒŠStreaming Systems
An Overview of Modern Memory Management Architectures in LLM Agents
vinithavn.medium.comยท1dยท
Discuss: Hacker News
๐Ÿ’พPersistence Strategies