Optimizing FP4 Mixed-Precision Inference on AMD GPUs
lmsys.org·1d
🧠Memory Hierarchy
Dynamic Resource Allocation in Asynchronous Compute Pipelines via Reinforcement Learning
dev.to·12h·
Discuss: DEV
🔮CPU Branch Prediction
The Syndrome-Space Lens: A Complete Resolution of Proximity Gaps for Reed-Solomon Codes
eprint.iacr.org·1d
🎯Ring Buffers
Storing Unwise Amounts of Data in JavaScript Bigints
jonathan-frere.com·1d·
🔢Binary Formats
What is Generative AI? A Comprehensive Beginner’s Guide
future.forem.com·5h·
Discuss: DEV
🎭Program Synthesis
LingoDB – Data Processing with Compiler Technology
lingo-db.com·1d·
Discuss: Hacker News
🗃️Query Compilation
Show HN: Optimizing DeepSeek's NSA for TPUs – A Kernel Worklog
henryhmko.github.io·1d·
Discuss: Hacker News
Tokenizer Optimization
A simulator significantly inspired by the first commercial transistor computer
git.sr.ht·9h·
🖥️Minimal VMs
Gauss–Seidel visually explained
wordsandbuttons.online·17h
🧮Linear Algebra
GTA -- An ATSP Method: Shifting the Bottleneck from Algorithm to RAM
arxiv.org·3d
⏲️Embedded GC
Python Can Now Call Mojo
towardsdatascience.com·11h
🛠programming language development
Built a database in Rust and got 1000x the performance of Neo4j
reddit.com·1d·
Discuss: r/rust
🔍Query Engines
什么是Online Softmax and Flash Attention?
dev.to·1d·
Discuss: DEV
🧮Linear Algebra
Unlock the power of SVE and SME with SIMD Loops
community.arm.com·2d
🤖Embedded Go
We, Programmers A Chronicle of Coders from Ada to AI
gwolf.org·5h
🏺Code Archeology
How to waste CPU like a Professional
mostlynerdless.de·2d·
Interpreter Optimization
Implementing a generic Schwartzian transform in Rust for fun
medium.com·1d·
Discuss: r/rust
🦀Rust Macros
I Spent Three Nights Solving Listen Labs Berghain Challenge (and Got #16)
kuber.studio·11h·
Discuss: Hacker News
🪢Rope Data Structures
MicroAlloc
bogdanthegeek.github.io·7h
🧠Memory Allocators
Welcome to the World of Embedded Systems with Python
avid-coders.com·2d·
Discuss: DEV
🤖Embedded Go