Attention Optimization, Memory Efficiency, Transformer Acceleration, IO-Aware

Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.com·11h·
Discuss: Substack
🧩Attention Kernels
Flag this post
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models
paperium.net·8h·
Discuss: DEV
🏎️TensorRT
Flag this post
Some Fun Videos on Optimizing NES Code
bumbershootsoft.wordpress.com·14h
🚀Compiler Optimization
Flag this post
This browser extension keeps every idea I highlight instantly searchable
makeuseof.com·16h
🐕Ruff
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.com·2d·
Discuss: Hacker News
👁️Attention Optimization
Flag this post
Everything About Transformers
krupadave.com·3d
🧩Attention Kernels
Flag this post
Minimax pre-training lead explains why no linear attention
reddit.com·3d·
Discuss: r/LocalLLaMA
👁️Attention Optimization
Flag this post
🧠 Soft Architecture (Part B): Emotional Timers and the Code of Care (Part 5 of the SaijinOS series)
dev.to·20h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
NVIDIA and Samsung working even closer together, new semiconductor AI factory has 50,000+ GPUs
tweaktown.com·7h
🔍Nsight
Flag this post
Dual-format attentional template during preparation in human visual cortex
elifesciences.org·4d
🧩Attention Kernels
Flag this post
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models
dev.to·8h·
Discuss: DEV
👁️Attention Optimization
Flag this post
Weak-To-Strong Generalization
lesswrong.com·7h
📉Model Quantization
Flag this post
Multitasking On The Humble Z80 CPU
hackaday.com·5h
📈Occupancy Optimization
Flag this post
HUSKYLENS 2 Expands Edge AI Vision with MCP Integration and YOLO Model Support
linuxgizmos.com·9h
🧮cuDNN
Flag this post
Microstutter in games? Your RGB software might be why
howtogeek.com·18h
📈Occupancy Optimization
Flag this post
Sparse Adaptive Attention “MoE”: How I Solved OpenAI’s $650B Problem With a £700 GPU
medium.com·4d·
🧩Attention Kernels
Flag this post
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
paperium.net·1d·
Discuss: DEV
📊Gradient Accumulation
Flag this post
Specialized structure of neural population codes in parietal cortex outputs
nature.com·1d
🧩Attention Kernels
Flag this post
Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws
arxiv.org·2d
🏎️TensorRT
Flag this post
Accelerating AI inferencing with external KV Cache on Managed Lustre
cloud.google.com·1d
ONNX Runtime
Flag this post