๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ’ป Local LLMs

Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI

What is a large language model?
proton.meยท23h
๐ŸŽตAudio ML
Lethe: Purifying Backdoored Large Language Models with Knowledge Dilution
arxiv.orgยท1d
๐Ÿ”BitFunnel
Add a privacy layer to your LLM app as AI companies' privacy policies evolve
medium.comยท21hยท
Discuss: Hacker News
๐Ÿ”BitFunnel
Hyper-Efficient Quantized Neural Network Pruning via Adaptive Bit-Width Allocation
dev.toยท20hยท
Discuss: DEV
๐Ÿ“ŠQuantization
5 Key Ways LLMs Can Supercharge Your Machine Learning Workflow
machinelearningmastery.comยท1d
๐ŸŽตAudio ML
Whirlaway: Multilinear STARKs using WHIR as polynomial commitment scheme
blog.lambdaclass.comยท1d
๐ŸŽฏPerformance Proofs
What AI chatbots are doing under the hood
gilesthomas.comยท20hยท
Discuss: Hacker News
๐ŸŽ™๏ธWhisper
Codeminer42 Dev Weekly #76
blog.codeminer42.comยท1d
๐Ÿ”„Language Evolution
Unlocking Multimodal Video Transcription with Gemini
towardsdatascience.comยท1d
โœ…Verification Codecs
AI Models Need a Virtual Machine
blog.sigplan.orgยท20hยท
Discuss: Hacker News, Hacker News
โš™๏ธTLA+
LLMs for the Old and Infirm
oblomovka.comยท7hยท
Discuss: Hacker News, www.oblomovka.com
๐ŸšNordic Shell
AI researcher Andrej Karpathy says he's "bearish on reinforcement learning" for LLM training
the-decoder.comยท2h
๐Ÿง Intelligence Compression
LLM Evaluation: Practical Tips at Booking.com
booking.aiยท17hยท
Discuss: Hacker News
๐Ÿ”—Constraint Handling
Arbitraging Down LLM Inference to the Cost of Electricity
inference.netยท1dยท
Discuss: Hacker News
๐Ÿ”’Linear Types
ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding
arxiv.orgยท2d
๐Ÿ“Linear Logic
AMD MI300X for LLM Serving Disaggregating Prefill and Decode with SGLang
rocm.blogs.amd.comยท1dยท
Discuss: Hacker News
๐ŸŒŠStreaming Compression
Measuring Reasoning Utility in LLMs via Conditional Entropy Reduction
arxiv.orgยท1d
๐Ÿง Intelligence Compression
Automated API Ecosystem Resilience Scoring via Hybrid Graph Neural Networks
dev.toยท19hยท
Discuss: DEV
๐Ÿ”—Topological Sorting
Evaluating Language Model Reasoning about Confidential Information
arxiv.orgยท2d
๐ŸงชBinary Fuzzing
Forcing ChatGPT to Obey: Minimal and Deterministic Rules
dev.toยท1hยท
Discuss: DEV
๐Ÿ“ABNF Parsing
Loading...Loading more...
AboutBlogChangelogRoadmap