🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎯 Vector Quantization

Product Quantization, Embedding Compression, Memory Efficiency, Approximate Search

Scaling Laws for LLM Based Data Compression
lesswrong.com·5h
📝Text Compression
Trainable Dynamic Mask Sparse Attention
arxiv.org·8h
🧠LLM Inference
Real-time neural video codec – 100 FPS 1080p and 4K videos
github.com·6h·
Discuss: Hacker News
🔬RaBitQ
How LLMs See the World
blog.bytebytego.com·21h
🧠LLM Inference
LeetCode #70: Climbing Stairs
anmoltomer.bearblog.dev·8h
🧮SMT Solvers
SAT Requires Exhaustive Search
link.springer.com·16h·
Discuss: Hacker News
🧮SMT Solvers
Beyond Manually Designed Pruning Policies with Second-Level Performance Prediction: A Pruning Framework for LLMs
arxiv.org·8h
🧠LLM Inference
Lessons from Amazon S3 Vector Store and the Nuances of Hybrid Vector Storage
caylent.com·21h·
Discuss: Hacker News
🏗️Search Infrastructure
Context Guided Transformer Entropy Modeling for Video Compression
arxiv.org·8h
📊Embeddings
Building, Fast and Slow
idiallo.com·4h
👨‍💻Software development practices
A Histogram Is a Generative Model
jonathandinu.com·21h·
Discuss: Hacker News
🎭Claude
E-VRAG: Enhancing Long Video Understanding with Resource-Efficient Retrieval Augmented Generation
arxiv.org·8h
📊Embeddings
Information Rates of Approximate Message Passing for Bandlimited Direct-Detection Channels
arxiv.org·8h
ℹ️Information Theory
Filtering with Self-Attention and Storing with MLP: One-Layer Transformers Can Provably Acquire and Extract Knowledge
arxiv.org·8h
🧠LLM Inference
Kernel-Based Sparse Additive Nonlinear Model Structure Detection through a Linearization Approach
arxiv.org·8h
🧠LLM Inference
Attention was never enough: Tracing the rise of hybrid LLMs
ai21.com·10m·
Discuss: Hacker News
🧠LLM Inference
Simple Methods Defend RAG Systems Well Against Real-World Attacks
arxiv.org·8h
💾Persistence Strategies
Accelerating multiparametric quantitative MRI using self-supervised scan-specific implicit neural representation with model reinforcement
arxiv.org·8h
📊Embeddings
Hessian analysis with JAX: a platform-agnostic, high-performance approach
lesswrong.com·7h
🕯️Candle
Dataset Condensation with Color Compensation
arxiv.org·8h
📊Embeddings
Loading...Loading more...
AboutBlogChangelogRoadmap