Generation at the Speed of Thought: Speculative Decoding
bittere.substack.comยท12hยท
Discuss: Substack
โšกFlash Attention
Flag this post
Beyond Start and End: PostgreSQL Range Types
boringsql.comยท1h
๐Ÿ”Type Checkers
Flag this post
Qwen3 VL 30b a3b is pure love
reddit.comยท3hยท
Discuss: r/LocalLLaMA
๐Ÿ“‰Model Quantization
Flag this post
How We Found 7 TiB of Memory Just Sitting Around
render.comยท3dยท
๐Ÿ“ˆGPU Occupancy
Flag this post
This popular free Windows 11/10 app update install manager just got faster
neowin.netยท1d
๐Ÿ”Nsight
Flag this post
Don't give Postgres too much memory
vondra.meยท2dยท
Discuss: Hacker News
๐Ÿ”ฒLoop Tiling
Flag this post
Power instead of chips: Why Microsoftโ€™s boss Nadella is putting the brakes on AI expansion
igorslab.deยท18h
โฑ๏ธCUDA Events
Flag this post
Homelab planning
i.redd.itยท1dยท
Discuss: r/homelab
๐Ÿ”Nsight
Flag this post
Engineering Driver Reignites Battlemage B770 GPU Speculation
techpowerup.comยท2d
๐Ÿ”งPTX
Flag this post
ZkML Breakthrough: 13B Models Verified in 15 Minutes
lightcapai.medium.comยท7hยท
Discuss: Hacker News
๐ŸŽฏTensor Cores
Flag this post
Dynamic Resource Allocation in CXL-Enabled Heterogeneous Compute Clusters
dev.toยท19hยท
Discuss: DEV
๐Ÿ”Nsight
Flag this post
Myths Programmers Believe about CPU Caches
software.rajivprab.comยท2dยท
Discuss: Hacker News
๐Ÿง CPU Architecture
Flag this post
Objects as Random Access Memory
tbr.bearblog.devยท21h
โœ‚๏ธCUTLASS
Flag this post
Your GPU isn't hitting 100% utilization, and that's completely fine
xda-developers.comยท1d
โฑ๏ธCUDA Events
Flag this post
Ambient CI, progress this year
blog.liw.fiยท16h
๐Ÿ—๏ธBuild Systems
Flag this post
Supercharge Your Web Apps: A Beginner's Guide to WebAssembly Optimization
dev.toยท1dยท
Discuss: DEV
๐Ÿš€Compiler Optimization
Flag this post
Is Arrow Lake worth it for Proxmox, Game streaming, and NextCloud?
forums.anandtech.comยท9h
๐Ÿ”งPTX
Flag this post