Erase and Rewind: Precise LLM Memory Manipulation for Safer AI by Arvind Sundararajan
dev.to·2h·
Discuss: DEV
🧠Memory Ordering
Cost-Effective, Orthogonal Approach to Resilient Memory Design (Univ. of Central Florida, UT San Antonio, Rochester)
semiengineering.com·3h
🧠Memory Models
Semantic Dictionary Encoding
falvotech.com·20h·
Discuss: Hacker News
🗂️Type Indexing
AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs
arxiv.org·6h
🔍ML Language
Machine Scheduler in LLVM
myhsu.xyz·3h·
Discuss: Hacker News
📅Instruction Scheduling
What is Algebraic about Algebraic Effects?
interjectedfuture.com·18h
💫Effect Systems
Safepoints and Fil-C
fil-c.org·6h·
Discuss: Hacker News
🎯Ring Buffers
Ask HN: How can I test FTS5 engine in SQLite3?
news.ycombinator.com·1h·
Discuss: Hacker News
Performance
Rowhammer: TRR on DDR5 DRAM has been broken
comsec.ethz.ch·18h·
🏷️Memory Tagging
StringWa.rs on GPUs: Databases & Bioinformatics 🦠
ashvardanian.com·15h·
🚀Tokenizer Performance
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·1d·
🌱Minimal ML
LLMs on a Shoestring: The Dynamic Cache Advantage by Arvind Sundararajan
dev.to·1d·
Discuss: DEV
💾Cache Algorithms
A Slotted Hash Cons for Alpha Invariance
philipzucker.com·16h·
Discuss: Hacker News
🔗Lexical Scoping
Zettelkasten for Programmers: Processing Swift Actor Usage Advice in Depth
christiantietze.de·4h
Gleam
Rendezvous Hashing Explained (2020)
randorithms.com·15h·
🔗Hash Algorithms
Verlog: A Multi-turn RL framework for LLM agents
blog.ml.cmu.edu·20h
🎭Erlang OTP
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·23h·
Discuss: r/LocalLLaMA
🗺️Region Inference
Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)
semiengineering.com·18h
🗺️Region Inference
Is Recursion in LLMs a Path to Efficiency and Quality?
pub.towardsai.net·10h
🪜Recursive Descent