🧠 CUDA Memory Management - miterion · Scour

Beyond a Single Queue: Multi-Level-Multi-Queue as an Effective Design for SSSP problems on GPUs

arxiv.org·1d

🌊CUDA Streams

Hitting 1,000 tokens per second on a single RTX 5090

blog.alpindale.net·3d·

Discuss: Hacker News, Hacker News

🎛️CUDA Optimization

borodark/exmc: Probabilistic programming in BEAM

github.com·18h

⚡ONNX Runtime

BOute: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via Multi-Objective Bayesian Optimization

arxiv.org·9h

Minimum Energy Per Query

semiengineering.com·6h

📈Occupancy Optimization

OLIX: Compute Manifesto

olix.com·1d·

Discuss: Hacker News

⚡CUDA Programming Patterns

Parallel Track Transformers: Enabling Fast GPU Inference with Reduced Synchronization

machinelearning.apple.com·2d

⏱️CUDA Events

building cuda-gdb from sources

redplait.blogspot.com·4d·

Discuss: redplait.blogspot.com

⚡CUDA Programming Patterns

An async HTTP server in ~80 lines of modern C++ (coroutines)

vixcpp.com·6h·

Discuss: Hacker News

⚙️JIT Compilation

Rust Memory Management: The Playroom Analogy

adacore.com·2d·

Discuss: Hacker News

Bitsum. Real-time CPU Optimization and Automation

bitsum.com·16h

📊Profiling Tools

Can you disable multithreaded calculations for avoidance logic?

forrestthewoods.com·3h·

Discuss: r/godot

⚡CUDA Programming Patterns

CXMT shifts 20 percent of DRAM capacity to HBM3, China’s AI strategy gets a memory upgrade

igorslab.de·9h

⚡Flash Attention

remote locks and distributed locks

tautik.me·23h

🌐Distributed Computing

Edge AI in a DRAM shortage: Doing more with less

edn.com·4h

⚡Flash Attention

Cache-aware disaggregated inference for up to 40% faster long-context LLM serving

together.ai·1d·

Discuss: Hacker News, r/LocalLLaMA

📈Occupancy Optimization

How I Built MemCP: Giving Claude a Real Memory

dev.to·1d·

Discuss: DEV

📊Profiling Tools

How to connect Convex to RunPod for serverless GPU workloads

stack.convex.dev·2d

How a ‘zombie’ chipmaker became Nvidia’s vital AI ally

ft.com

·1d

🎯GPU Kernels

Game Boy Advance Dev: Drawing Pixels

mattgreer.dev·1d·

Discuss: r/programming

Loading more...