Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·14h·
📊Embeddings
Semantic Dictionary Encoding
falvotech.com·4h·
Discuss: Hacker News
💾Binary Formats
LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.org·14h
🧠LLM Inference
Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)
semiengineering.com·2h
🧠LLM Inference
Basic Guide to Einsum
ajcr.net·23h·
Discuss: Hacker News
🔄SIMD Programming
Baking with Rails at scale: recipes in Ruby, cookware from Go, C, and Rust
evilmartians.com·18h
🏹Apache Arrow
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·7h·
Discuss: r/LocalLLaMA
🧠LLM Inference
Mathematics Discovering Its Consciousness: Lasso Estimator as Cosmic Backdoor
zakelfassi.com·20h·
Discuss: Hacker News
🧠LLM Inference
UTF-8 Is Beautiful
hackaday.com·13h
📋Markdown
Crashes are loud. Leaks are quiet.
blog.bitdrift.io·18h
💾Persistence Strategies
What is Algebraic about Algebraic Effects?
interjectedfuture.com·2h
💻Programming languages
Balance between refactoring and inheritance in your code
github.com·6h·
Discuss: Hacker News
🪄Prompt Engineering
Vibe Check: GPT-5 Codex Can Code for 35 Minutes Straight—If You Ask Nicely
kill-the-newsletter.com·1h
🪄Prompt Engineering
[CS 2881r AI Safety] [Week 1] Introduction
lesswrong.com·22h
🛡️AI Safety
More hardware won’t fix bad engineering
infoworld.com·9h
⚙️Mechanical Sympathy
CoDiCodec: Unifying Continuous and Discrete Compressed Representations of Audio
arxiv.org·14h
🗜️Zstd
Why machines struggle with the unknown: Exploring the gap in human and AI learning
techxplore.com·5h
🆕New AI
A Dumb Introduction to z3. Exploring the world of constraint solvers with very simple examples.
asibahi.github.io·21h·
🧮SMT Solvers
Filtering After Shading with Stochastic Texture Filtering
research.nvidia.com·24m·
Discuss: Hacker News
🔤Font Rendering