Attention Optimization, Memory Efficiency, Transformer Acceleration, IO-Aware

TinyML is the most impressive piece of software you can run on any ESP32
xda-developers.com·2d
ONNX Runtime
Flag this post
EP187: Why is DeepSeek-OCR such a BIG DEAL?
blog.bytebytego.com·18h
🤖AI Coding Tools
Flag this post
😺 🎙️ Adobe’s CTO: How AI will end creative “grunt work”
theneurondaily.com·1d
🤖AI Coding Tools
Flag this post
Brain-mimicking artificial neuron could solve AI’s growing energy problem
psypost.org·10h
📊Gradient Accumulation
Flag this post
Show HN: Hot or Slop – Visual Turing test on how well humans detect AI images
hotorslop.com·2d·
Discuss: Hacker News
👁️Attention Optimization
Flag this post
Beyond Optimization: The Physics and Logic Driving AI's Three Stages of Societal Transformation
youtu.be·11h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
An underqualified reading list about the transformer architecture
fvictorio.github.io·2d·
Discuss: Hacker News
🧩Attention Kernels
Flag this post
RF-DETR Under the Hood: The Insights of a Real-Time Transformer Detection
towardsdatascience.com·1d
👁️Attention Optimization
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.me·2d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
A Minimal Route to Transformer Attention
neelsomaniblog.com·3d·
Discuss: Hacker News
🧩Attention Kernels
Flag this post
Opportunistically Parallel Lambda Calculus
dl.acm.org·2d·
Discuss: Hacker News
💡LSP
Flag this post
LTX2 Video – Open-Access AI Video Generator with Synchronized Audio
ltx2.video·3h·
Discuss: Hacker News
🤖AI Coding Tools
Flag this post
Semantic search with embeddings in PHP: a hands-on guide using Neuron AI and Ollama
ollama.com·12h·
Discuss: DEV
🛠Ml-eng
Flag this post
Feature Infrastructure Engineering: A Comprehensive Guide
mlfrontiers.substack.com·17h·
Discuss: Substack
ONNX Runtime
Flag this post
Jackknife Transmittance and MIS Weight Estimation
momentsingraphics.de·12h·
Discuss: Hacker News
🏎️TensorRT
Flag this post
L16 Benchmark: How Prompt Framing Affects Truth, Drift, and Sycophancy in GEMMA-2B-IT vs PHI-2
colab.research.google.com·20h·
Discuss: r/LocalLLaMA
⏱️Benchmarking
Flag this post
AI Inference: The Silent Budget Killer (and How to Stop It)
dev.to·8h·
Discuss: DEV
ONNX Runtime
Flag this post
Cycle-accurate 6502 emulator as coroutine in Rust
github.com·19h·
📊Profiling Tools
Flag this post
Reimagining Video Creation with AI and Cinematography Automation
reddit.com·19h·
Discuss: r/midjourney
👁️Attention Optimization
Flag this post
Objects as Random Access Memory
tbr.bearblog.dev·7h
✂️CUTLASS
Flag this post