Be Careful When Assigning ArenaAllocators (2024)
openmymind.net·3h·
🧠Memory Allocation Strategies
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·16h·
Discuss: r/LocalLLaMA
🧠LLM Inference
More hardware won’t fix bad engineering
infoworld.com·19h
⚙️Mechanical Sympathy
Symmetric MultiProcessing, Hyper-Threading and scheduling on Maestro
blog.lenot.re·20h
🔄Cache Coherence
Balance between refactoring and inheritance in your code
github.com·16h·
Discuss: Hacker News
🪄Prompt Engineering
What is Algebraic about Algebraic Effects?
interjectedfuture.com·12h
💻Programming languages
Building a Simple Stack-Based Virtual Machine in Go
blog.phakorn.com·21h·
🧠Memory Hierarchy Design
PCSX2 2.4.0
majorgeeks.com·21h
📟Terminals
<p>🔗 <a href="https://stephango.com/file-over-app">Steph Ango: File over app</a></p>
lmika.org·5h
💾Persistence Strategies
Semantic Dictionary Encoding
falvotech.com·13h·
Discuss: Hacker News
💾Binary Formats
Today we're launching Reserved Instances
threadreaderapp.com·5h
🖥GPUs
AMD Continues Enhancing AMDGPU/AMDKFD Drivers For Checkpoint/Restore
phoronix.com·18h
Zero-Copy APIs
Rendezvous Hashing Explained (2020)
randorithms.com·8h·
🌳Data Structures
Is Recursion in LLMs a Path to Efficiency and Quality?
pub.towardsai.net·4h
🧠LLM Inference
StringWa.rs on GPUs: Databases & Bioinformatics 🦠
ashvardanian.com·9h·
🗂️Vector Indexes
Verlog: A Multi-turn RL framework for LLM agents
blog.ml.cmu.edu·13h
🏆LLM Benchmarking
Why you should care about the JDBC fetch size
in.relation.to·16h·
Discuss: r/programming
⚙️Database Internals
Securing and Scaling AI-Powered APIs
capestart.com·15h·
Discuss: Hacker News
🧠Inference Serving
How next-gen laptops use NPUs for massive power savings
nordot.app·17h
🖥️Hardware Architecture