H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference
arxiv.org·3d
💨Cache Optimization
#16 Pronic, oblong, rectangular numbers.... Etymology and History of Math Terms
pballew.blogspot.com·1h·
λLambda Encodings
Neural Networks from Scratch in Python: Simpler Than You Think
hamza.se·11h·
Discuss: Hacker News
📊Quantization
An enough week
blog.mitrichev.ch·1d·
🧮Z3 Solver
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.ai·1d
📊Feed Optimization
Explicit Lossless Vertex Expanders!
gilkalai.wordpress.com·22h
💎Information Crystallography
[P] Lossless compression for 1D CNNs
reddit.com·21h·
📊Quantization
A gentle introduction to Generative AI: Historical perspective
medium.com·7h·
Discuss: Hacker News
🧠Learned Codecs
Enhanced Predictive Maintenance of Geothermal Heat Exchangers via Hybrid Bayesian Optimization and LSTM
dev.to·3h·
Discuss: DEV
💻Local LLMs
Revisiting Karpathy's 'Unreasonable Effectiveness of Recurrent Neural Networks'
gilesthomas.com·7h·
Discuss: Hacker News
🎧Learned Audio
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.ai·1d·
Discuss: Hacker News
💻Local LLMs
Contrastive Weak-to-strong Generalization
arxiv.org·1d
Information Bottleneck
Sorting encrypted data without decryption: a practical trick
dev.to·17h·
Discuss: DEV
🔐Hash Functions
In-Depth Analysis: "Attention Is All You Need"
dev.to·17h·
Discuss: DEV
🧠Intelligence Compression
Doing Math with Embeddings for Better AI Ad Targeting
ethicalads.io·2d·
Discuss: Hacker News
📊Feed Optimization
Optimal Stopping in Latent Diffusion Models
arxiv.org·1d
🧠Machine Learning
Laion, the dataset behind Stable Diffusion (2023)
deeplearning.ai·2h·
Discuss: Hacker News
🎓Academic Torrents
MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding
arxiv.org·1d
🧠Learned Codecs