💻 Local LLMs - matmat · Scour

PackInfer: Compute- and I/O-Efficient Attention for Batched LLM Inference

arxiv.org·1d

📊HyperLogLog

Large Language Models for Mortals book released

crimede-coder.com·1h·

Discuss: Hacker News

λLambda Formalization

Zero-Latency Local AI: Tuning Your Linux Kernel for LLM Inference 🐧🧠

dev.to·3d·

Discuss: DEV

⚡Homebrew CPUs

SimGR: Escaping the Pitfalls of Generative Decoding in LLM-based Recommendation

arxiv.org·11h

🎛️Feed Filtering

Faster AI Training Unlocked With New System For Massive Language Models

quantumzeitgeist.com·1d

🚀SIMD Text Processing

Machine learning reveals hidden landscape of robust information storage

phys.org·1h

⚛️Quantum Storage

Tutorial – What is a variational autoencoder?

jaan.io·22h·

Discuss: Hacker News

🧠Neural Codecs

LocalGPT: A local AI assistant with persistent memory in a single binary

localgpt.app·20h·

Discuss: Hacker News

⚡Homebrew CPUs

Quantization-Aware Distillation

ternarysearch.blogspot.com·2d·

Discuss: Hacker News

📊Quantization

LLMs Refuse High-Cost Attacks but Stay Vulnerable to Cheap, Real-World Harm

expectedharm.github.io·10h·

Discuss: Hacker News

🛡️WASM Sandboxing

Sneaky quokka: Testing and debugging with LLMs

honnibal.dev·7h

🧪Binary Fuzzing

Document Clustering with LLM Embeddings in Scikit-learn

machinelearningmastery.com·5h

🧮Vector Embeddings

A practical systems engineering guide: Architecting AI-ready infrastructure for the agentic era

thenewstack.io·17h

How StrongDM’s AI team build serious software without even looking at the code

jmason.ie·1d

⚔️Lean Tactics

Main Content || Math ∩ Programming

jeremykun.com·1d

🧮Kolmogorov Complexity

Understanding LLM Inference Engines: Inside Nano-vLLM (Part 2)

neutree.ai·4d·

Discuss: Hacker News

📊Quantization

Show HN: Deterministic linguistic enrichment pipeline for Node.js

npmjs.com·4h·

Discuss: Hacker News

🌀Brotli Internals

A Note on Flat Abstract Syntax Trees

gist.github.com·21h·

Discuss: Hacker News

🔗Monadic Parsing

The risks of OpenAI's Whisper audio transcription model

baldurbjarnason.com·4h·

Discuss: Hacker News

Unlocking Knowledge with AI

zappable.com·1d

🤖AI Curation

Loading more...