👁️ Attention Optimization - miterion · Scour

LUCID: Attention with Preconditioned Representations

arxiv.org·1d

🧩Attention Kernels

WildCat: Near-Linear Attention in Theory and Practice

arxiv.org·2d

⚡Flash Attention

The 4 Flash Attention Variants: How to Train Transformers 10× Longer Without Running Out of Memory

pub.towardsai.net

·5d

⚡Flash Attention

Index Exchange embeds AI attention signals into SSP for pre-bid targeting

google.com·10h

⚡Flash Attention

Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

machinelearning.apple.com·15h

🎓Model Distillation

The 4 Precision Formats: How to Train AI 2× Faster with Half the Memory

pub.towardsai.net

·1h

📉Model Quantization

Show HN: A segmentation model client-side via WASM

qtoolkit.dev·1d·

Discuss: Hacker News

🧩Attention Kernels

How Transformer Architecture Powers LLMs

dev.to·1d·

Discuss: DEV

🧩Attention Kernels

Optimal timing for superintelligence

feeds.feedblitz.com·15h

⚡Flash Attention

Quick eye movements help the brain map the world in 3D

earth.com·50m

⚡Flash Attention

One Task at a Time, Even with AI

wakamoleguy.com·34m·

Discuss: Hacker News

🤖AI Coding Tools

MiniMaxAI/MiniMax-M2.5

huggingface.co·1h·

Discuss: Hacker News, r/LocalLLaMA

🤖AI Coding Tools

Diffusion Models for ARC-AGI: A Retrospective

christopherhwood.com·1d·

Discuss: Hacker News

🏎️TensorRT

Multi-TPC: A Multimodal Dataset for Three-Party Conversations with Speech, Motion, and Gaze

nature.com·1d

🧩Attention Kernels

Neuromorphic HW That Detects Motion Changes 4X Faster (Beihang, BIT, KAUST, Cambridge et al.)

semiengineering.com·1d

⚡Flash Attention

Beyond Kuramoto Models: Associative Memory and Plastic Synapses in ML Ensembles

hackernoon.com·2d

📊Gradient Accumulation

Create cinematic AI videos from text and images

seedance20.site·6h·

Discuss: Hacker News

⚡Flash Attention

polyrhachis/macrograd: A lightweight autograd engine inspired by PyTorch and micrograd

github.com·2h·

Discuss: Hacker News

📜TorchScript

Powering AI Centers with AI Spines

blogs.arista.com·17h

🧩Attention Kernels

Index Exchange embeds AI attention signals into SSP for pre-bid targeting

ppc.land·22h

⚡Flash Attention

Loading more...