📉 Model Quantization - miterion · Scour

Train Less, Infer Faster: Efficient Model Finetuning and Compression via Structured Sparsity

arxiv.org·2d

🎓Model Distillation

Astro: Activation-guided Structured Regularization for Outlier-Robust LLM Post-Training Quantization

arxiv.org·3d

📊Gradient Accumulation

Digitizing the "Shokunin": How we encoded a Master's hammer strike into AI

yusukekaizen.substack.com·1d·

Discuss: Substack

🤖AI Coding Tools

Finding cancer cells in a cocktail of complex tissues

sciworthy.com·19h

🧩Attention Kernels

Grassmannian Manifold Learning: Optimization and Deep Learning Architectures

hackernoon.com·1d

🏎️TensorRT

New Ovis2.6-30B-A3B, a lil better than Qwen3-VL-30B-A3B

huggingface.co·19h·

Discuss: r/LocalLLaMA

Storing Image Data As Analog Audio

hackaday.com·10h

Deterministic Inference with EigenAI

deterministicinference.com·1d

🏎️TensorRT

Karpathy's Micro LLM in JavaScript

github.com·15h·

Discuss: Hacker News

🤖AI Coding Tools

A C implementation of the inference pipeline for the Mistral AI’s Voxtral Realtime 4B model

blog.adafruit.com·14h

🏎️TensorRT

datascienceweekly.substack.com·11h·

Discuss: Substack

⏱️Benchmarking

The 5 Model Compression Techniques: How to Shrink AI 10× Without Losing Accuracy

pub.towardsai.net·6d

🎓Model Distillation

Don't give away to the gradient descent

carteakey.dev·1d·

Discuss: Hacker News

📊Gradient Accumulation

Ai’s Inner Workings Revealed By Model Trained On One Billion Data Points

quantumzeitgeist.com·14h

📊Gradient Accumulation

How Andrej Karpathy Built a Working Transformer in 243 Lines of Code

analyticsvidhya.com·18h

📜TorchScript

Recursive Language Models: Stop Stuffing the Context Window

nlp.elvissaravia.com·11h

⚡ONNX Runtime

karpathy.github.io·1d

📜TorchScript

Show HN: Latent-k – Persistent dependency map to reduce AI coding token usage

latentk.org·1d·

Discuss: Hacker News

🤖AI Coding Tools

Large Language Models for Mortals book

andrewpwheeler.com·1d

🎓Model Distillation

(Re)Discovering Natural Laws

lesswrong.com·9h

Loading more...