🔲 Loop Tiling - miterion · Scour

The Avatar Cache: Enabling On-Demand Security with Morphable Cache Architecture

arxiv.org·1d

⚡CUDA Programming Patterns

Same Engine, Multiple Gears: Parallelizing Fixpoint Iteration at Different Granularities (Extended Version)

arxiv.org·1d

🌊CUDA Streams

Local-First AI: How SLMs are Fixing the Latency Gap 💻✨

dev.to·1d·

Discuss: DEV

⚡Flash Attention

Your VCL App: 4x to 11x Faster Math Performance with Elements

blogs.remobjects.com·1d·

Discuss: Hacker News

Comparing accumulate to C++23s fold_left

meetingcpp.com·2d

🚀Compiler Optimization

What should I program?

jamesmcm.github.io·2d

Your Ray Data Pipeline Works at 10K Samples. Here's Why It Crashes at 1M.

dev.to·1d·

Discuss: DEV

🌐Distributed Computing

Faster than Dijkstra?

systemsapproach.org·1d·

Discuss: Hacker News

📊CUDA Graphs

Lucene HNSW performance: A deep dive into the OS page cache

opensearch.org·1d

📊Profiling Tools

How the GNU C Compiler became the Clippy of cryptography

theregister.com·1d·

Discuss: Hacker News, r/programming

🚀Compiler Optimization

Why JavaScript Needs Structured Concurrency | Blog

frontside.com·1d·

Discuss: Hacker News, r/javascript

🚀Compiler Optimization

January 2026 Monthly report | Alternative Rust Compiler for GCC

rust-gcc.github.io·8h·

Discuss: r/rust

Understanding the Go Runtime: The Bootstrap

internals-for-interns.com·1d·

Discuss: Hacker News, r/golang

📊Profiling Tools

the mathematics of compression in database systems

bitsxpages.com·1d·

Discuss: Hacker News

📉Model Quantization

Sculptor: The missing UI for coding agents

imbue.com·18h

🤖AI Coding Tools

John Carmack muses using a long fiber line as as an L2 cache for streaming AI data — programmer imagines fiber as alternative to DRAM

tomshardware.com

·1d·

Discuss: Hacker News

⚡Flash Attention

Geospatial System Design Patterns

systemdr.substack.com·2d·

Discuss: Substack

⚡CUDA Programming Patterns

Concurrent vs. Parallel Execution in LLM API Calls: From an AI Engineer’s Perspective

pub.towardsai.net·1d

🤖AI Coding Tools

Intel Core Ultra "Arrow Lake Refresh" Chips Focus on E-core Count and L3 Cache Uplifts

techpowerup.com·1d

🧠CPU Architecture

artima.com·2d

📊Profiling Tools

Loading more...