OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
arxiv.orgยท19h
๐Ÿง LLM Inference
Explicit Lossless Vertex Expanders!
gilkalai.wordpress.comยท13h
๐Ÿ”ฌRaBitQ
Show HN: Rebuilt Bible search app to run 100% client-side with Transformers.js
biblos.appยท2hยท
Discuss: Hacker News
๐Ÿš€LanceDB
From Text to Token: How Tokenization Pipelines Work
paradedb.comยท23h
๐Ÿ”คTokenization
The DINOv3 Playbook for Computer Vision Data Science
pub.towardsai.netยท10h
๐Ÿ“ŠVector Databases
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.aiยท23h
๐Ÿ“ŠFeed Optimization
GoMem is a high-performance memory allocator library for Go
github.comยท21h
๐Ÿง Memory Allocators
(Forward) automatic implicit differentiation in Rust with num-dual 0.12.0
reddit.comยท8hยท
Discuss: r/rust
๐ŸŽญRust Macros
Scaling Time-Series Data for AI Models
singlestore.comยท8h
๐ŸŽ›๏ธFeed Filtering
BQN "Macros" with โ€ขDecompose (2023)
saltysylvi.github.ioยท1hยท
Discuss: Hacker News
๐ŸŽญRust Macros
Iterated Development and Study of Schemers (IDSS)
lesswrong.comยท9h
๐Ÿ†•New AI
Personal Knowledge Management Systems & Digital Gardens
lavenderlit.bearblog.devยท17h
๐Ÿ”ŽInverted Index
MultiPar 1.3.3.5 Beta / 1.3.2.9
majorgeeks.comยท15h
๐Ÿ“„File Formats
NExF: Learning Neural Exposure Fields for View Synthesis
m-niemeyer.github.ioยท16hยท
Discuss: Hacker News
๐Ÿ—๏ธLLM Infrastructure
Your crawl budget is costing you revenue in the AI search era by Semrush Enterprise
searchengineland.comยท12h
๐Ÿ’ณContent Monetization
FFTY: Spectacular Returns In 2025 Mask Issues
seekingalpha.comยท17h
๐ŸฏTigerBeetle
Nearest Neighbor CCP-Based Molecular Sequence Analysis
arxiv.orgยท19h
๐Ÿ”Vector Search Algorithms