Attention Optimization, Memory Efficiency, Transformer Acceleration, IO-Aware

Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.com·1d·
Discuss: Substack
🧩Attention Kernels
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.com·18h
🧠CPU Architecture
Flag this post
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm EnablesFine-Grained Policy Optimization
paperium.net·12h·
Discuss: DEV
🧩Attention Kernels
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·4h·
Discuss: Hacker News
📜TorchScript
Flag this post
Scalable In-Memory Associative Processing for Graph Neural Network Inference
dev.to·11h·
Discuss: DEV
🎯GPU Kernels
Flag this post
Generation at the Speed of Thought: Speculative Decoding
bittere.substack.com·12h·
Discuss: Substack
🎯Tensor Cores
Flag this post
Learning to program "recycles" preexisting F-P pop codes of logical algorithms
jneurosci.org·8h·
Discuss: Hacker News
📊Gradient Accumulation
Flag this post
MobileNetV3 Paper Walkthrough: The Tiny Giant Getting Even Smarter
towardsdatascience.com·10h
🎯Tensor Cores
Flag this post
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models
dev.to·21h·
Discuss: DEV
👁️Attention Optimization
Flag this post
Uber launches platform-specific attention metric with Adelaide and Kantar
ppc.land·5h
👁️Attention Optimization
Flag this post
Platform generated AI slop at scale
markjgsmith.com·1h
🤖AI Coding Tools
Flag this post
Vision = Language: I Decoded VLM Tokens to See What AI 'Sees' 🔬
reddit.com·4h·
Discuss: r/LocalLLaMA
🛠Ml-eng
Flag this post
These 3 lesser-known free Android apps make my life easier
makeuseof.com·1d
🤖AI Coding Tools
Flag this post
CEO Interview with Wilfred Gomes of Mueon Corporation
semiwiki.com·7h
⚙️Systems Programming
Flag this post
Some Fun Videos on Optimizing NES Code
bumbershootsoft.wordpress.com·1d
🚀Compiler Optimization
Flag this post
Intel's killed-off BMG-X3/X4 GPUs: 3D stacked die, up to 40 GPU cores, 512MB Adamantine cache
tweaktown.com·2h
🔧PTX
Flag this post
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models
paperium.net·21h·
Discuss: DEV
🏎️TensorRT
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.com·2d·
Discuss: Hacker News
👁️Attention Optimization
Flag this post
Product Designer's workflow for prototyping with Cursor
hvpandya.com·7h·
Discuss: Hacker News
🤖AI Coding Tools
Flag this post