🧩 Memory Interleaving - hello · Scour

TLX: Hardware-Native, Evolvable MIMW GPU Compiler for Large-scale Production Environments ⚡Hardware Acceleration

The M:N Concurrent Model — A Complete Guide. From First Principles to Production Schedulers 🧵Lightweight Threads

0xkiire.com·3d·Hacker News, r/golang, r/rust

Why gRPC Is Fast: The Real Reason Is HTTP/2, Not Just Protobuf 🔌gRPC

javarevisited.substack.com·2d·r/programming

Knowledge gaps for neuromorphic ionic computing 🧮Intel MKL-DNN

science.org·5d

Regulating Branch Parallelism in LLM Serving 🧵OpenMP

FractalSortCPU: Bandwidth-Efficient Compressed Radix Sort on CPU 📋Columnar Storage

Data Path Fusion in GPU for Analytical Query Processing 📊Vectorized Query Execution

HexiSeq: Accommodating Long Context Training of LLMs over Heterogeneous Hardware 🔄Hardware Transactional Memory

Surviving Partial Rank Failures in Wide Expert-Parallel MoE Inference 🧩mimalloc

Stencil Computations on Cerebras Wafer-Scale Engine 🌀Naiad

Towards Compute-Aware In-Switch Computing for LLMs Tensor-Parallelism on Multi-GPU Systems 🚀Intel ISPC

arxiv.org·4d·Hacker News

A Controlled Study of Memory Hierarchy Transitions in Quantum Circuit Simulation on Apple M4 Pro Unified Memory Architecture ⚛️Quantum Computing

On Similarity of Computational Kernels in our Codes and Proxies 🧮Vector Databases

Unleashing Scalable Context Parallelism for Foundation Models Pre-Training via FCP 🤖TVM

Piper: Efficient Large-Scale MoE Training via Resource Modeling and Pipelined Hybrid Parallelism 🧩mimalloc

An Efficient Hybrid Sparse Attention with CPU-GPU Parallelism for Long-Context Inference 🔬Deep Learning

EnergyLens: Interpretable Closed-Form Energy Models for Multimodal LLM Inference Serving 🤖TVM

TAD: Temporal-Aware Trajectory Self-Distillation for Fast and Accurate Diffusion LLM 🤖TVM

CCL-Bench 1.0: A Trace-Based Benchmark for LLM Infrastructure 🚀Performance

arxiv.org·4d·Hacker News

Enhancing Performance Insight at Scale: A Heterogeneous Framework for Exascale Diagnostics 📊Extrae

Log in to enable infinite scrolling