H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference
arxiv.org·2d
💨Cache Optimization
Neural Networks from Scratch in Python: Simpler Than You Think
hamza.se·6h·
Discuss: Hacker News
📊Quantization
An enough week
blog.mitrichev.ch·1d·
🧮Z3 Solver
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.ai·1d
📊Feed Optimization
Explicit Lossless Vertex Expanders!
gilkalai.wordpress.com·17h
💎Information Crystallography
[P] Lossless compression for 1D CNNs
reddit.com·16h·
📊Quantization
A gentle introduction to Generative AI: Historical perspective
medium.com·1h·
Discuss: Hacker News
🧠Learned Codecs
Automated Spectral Fingerprint Deconvolution for Polymer Identification via Deep Oligomer Networks
dev.to·1h·
Discuss: DEV
🌈Spectroscopy
Revisiting Karpathy's 'Unreasonable Effectiveness of Recurrent Neural Networks'
gilesthomas.com·1h·
Discuss: Hacker News
🎧Learned Audio
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.ai·1d·
Discuss: Hacker News
💻Local LLMs
In-Depth Analysis: "Attention Is All You Need"
dev.to·11h·
Discuss: DEV
🧠Intelligence Compression
Contrastive Weak-to-strong Generalization
arxiv.org·22h
Information Bottleneck
Integral Signatures of Activation Functions: A 9-Dimensional Taxonomy and Stability Theory for Deep Learning
arxiv.org·22h
🧠Machine Learning
Why Your Simple Password Is a Mathematical Catastrophe
tawandamunongo.dev·1d·
Discuss: Hacker News
🔐Hash Functions
Sorting encrypted data without decryption: a practical trick
dev.to·11h·
Discuss: DEV
🔐Hash Functions
Doing Math with Embeddings for Better AI Ad Targeting
ethicalads.io·2d·
Discuss: Hacker News
📊Feed Optimization
Optimal Stopping in Latent Diffusion Models
arxiv.org·22h
🧠Machine Learning
MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding
arxiv.org·22h
🧠Learned Codecs
Activation Alchemist: Sculpting Stability with Functional Signatures
dev.to·6h·
Discuss: DEV
🔍Concolic Testing