Speculative Decoding: Making LLMs Faster Without Sacrificing Quality
dev.to·3h·
Discuss: DEV
🚀SIMD Text Processing
Flag this post
Smoothsort Demystified
keithschwarz.com·16h·
Discuss: Hacker News
🌳Trie Structures
Flag this post
Scheduling in LLM Inference
fergusfinn.com·1d·
Discuss: Hacker News
💻Local LLMs
Flag this post
PANDA - Patch And Distribution-Aware Augmentation for Long-Tailed Exemplar-Free Continual Learning
arxiv.org·1d
🧠Machine Learning
Flag this post
GitHub - tdewolff/canvas: Vector graphics in Go
github.com·2d
📟Terminal Typography
Flag this post
5 Essential Python Scripts for Intermediate Machine Learning Practitioners
machinelearningmastery.com·2d
🏠Homelab Pentesting
Flag this post
Can Language Models Optimize Real-World Repositories on Real Workloads?
swefficiency.com·11m·
Discuss: Hacker News
Performance Mythology
Flag this post
How is it that this problem, with its 21 data points, is so much easier to handle with 1 predictor than with 16 predictors?
statmodeling.stat.columbia.edu·20h
🧮Kolmogorov Bounds
Flag this post
EyesOff: I Built a Screen Contact Detection Model
ym2132.github.io·2h·
Discuss: Hacker News
📊Learned Metrics
Flag this post
Intro to Routing: Mixture-of-Experts and Expert Choice
neelsomaniblog.com·13h·
Discuss: Hacker News
🧮Kolmogorov Bounds
Flag this post
MySQL COUNT Scalar Subquery Optimization: The Complete Guide
dev.to·1d·
Discuss: DEV
🚀Query Optimization
Flag this post
Spec-Driven Development: The Waterfall Strikes Back
marmelab.com·3h·
Discuss: Hacker News
Format Verification
Flag this post
Human or Machine? Low-Latency Audio Detection of Humans at Scale
nooks.ai·1d·
Discuss: Hacker News
🌊Stream Processing
Flag this post
Data Science Weekly – Issue 625
datascienceweekly.substack.com·1d·
Discuss: Substack
📰RSS Archaeology
Flag this post
Asynchronous Wait-Free Runtime Verification and Enforcement of Linearizability
arxiv.org·1d
🎯Performance Proofs
Flag this post
Efficient Hyperdimensional Computing with Modular Composite Representations
arxiv.org·1d
💎Information Crystallography
Flag this post
Understanding neural networks through sparse circuits
openai.com·2d·
Discuss: Hacker News
🧠Machine Learning
Flag this post
How the PolyBlocks AI Compiler Works
docs.polymagelabs.com·2d·
Discuss: Hacker News
🧮Compute Optimization
Flag this post
IBM Patented Euler's 200 year old Math Technique
leetarxiv.substack.com·1d·
👑Coq Tactics
Flag this post
TARG: Training-Free Adaptive Retrieval Gating for Efficient RAG
arxiv.org·1d
🔍Information Retrieval
Flag this post