👁️ Attention Optimization - miterion · Scour

Learning to Remember, Learn, and Forget in Attention-Based Models

arxiv.org·2d

🧩Attention Kernels

LOTFormer: Doubly-Stochastic Linear Attention via Low-Rank Optimal Transport

arxiv.org·3d

⚡Flash Attention

Show HN: I taught AI to remember. Then it warned me

github.com·8h·

Discuss: Hacker News

🤖AI Coding Tools

Spot The Difference

seekingalpha.com

·1h

The import of cross-task productivity

marginalrevolution.com·1h

⚡Flash Attention

Focus and clarity

shhra.bearblog.dev·2d

⚡Flash Attention

Training-Free Real-Time Control for Autoregressive Video Generation

daydream.live·18h·

Discuss: Hacker News

🏎️TensorRT

Carnegie Mellon at NeurIPS 2025

blog.ml.cmu.edu·1d

📊Gradient Accumulation

Image Classification with CNNs – Part 3: Understanding Max Pooling and Results

dev.to·12h·

Discuss: DEV

Scaling LLM Post-Training at Netflix

netflixtechblog.com·57m

🏎️TensorRT

The “Think in Pictures” Upgrade for Multimodal Models

hackernoon.com·1d

🧩Attention Kernels

Google Deepmind upgrades Gemini 3 Deep Think for complex science and engineering tasks

the-decoder.com·15h

🎯Tensor Cores

BetaZero V2: A Diffusion Model for Setting Boulder Problems

evmojo37.substack.com·9h·

Discuss: Substack

📊Gradient Accumulation

Quality and understandability after AI

federicopereiro.com·22h·

Discuss: Hacker News

🤖AI Coding Tools

Gibbs Measures from Deep Shaped Multilayer Perceptrons

link.aps.org·20h

📊Gradient Accumulation

My Go-To AI Tools: February 2026 Update

whytryai.com

·22h

🤖AI Coding Tools

Digitizing the "Shokunin": How we encoded a Master's hammer strike into AI

yusukekaizen.substack.com·1d·

Discuss: Substack

📉Model Quantization

Show HN: The Algorithm's Favorite Child

chatbotkit.com·17h·

Discuss: Hacker News

⚡ONNX Runtime

Arming the rebels with GPUs: Gradium, Kyutai, and Audio AI

amplifypartners.com·3h·

Discuss: Hacker News

🏎️TensorRT

Architectural and Mathematical Foundations of Machine Learning: A Rigorous Synthesis of Theory, Geometry, and Implementation

chizkidd.github.io·1d·

Discuss: Hacker News

📉Model Quantization

Loading more...