Vectorization, AVX, SSE, Parallel Processing

Gröbner Bases Explained: From Abstract Algebra to Real-World Optimization
news.ycombinator.com·1w·
Discuss: Hacker News
🔢Number Theory
Flag this post
Python Concurrency and Parallelism: 8 Essential Techniques for High-Performance Applications
dev.to·1w·
Discuss: DEV
⚙️Compilers
Flag this post
Parallelisation of partial differential equations via representation theory
arxiv.org·3w
📐Linear Algebra
Flag this post
Cycle-accurate 6502 emulator as coroutine in Rust
github.com·5d·
⚙️Compilers
Flag this post
AI accelerator selection for inference: A stage-based framework
developers.redhat.com·1w
⚙️Compilers
Flag this post
HALAC 0.4.3
reddit.com·1w·
Discuss: r/compsci
📡DSP
Flag this post
Microsoft Readies Windows-on-Arm for Gaming With AVX/AVX2 Support
techpowerup.com·2w
🌐WebAssembly
Flag this post
DeepPrune: Parallel Scaling without Inter-trace Redundancy
dev.to·2w·
Discuss: DEV
⚙️Compilers
Flag this post
Optimizing gpt-oss-120B on AMD RX 6900 XT 16GB: Achieving 19 tokens/sec
reddit.com·1w·
Discuss: r/LocalLLaMA
🧠CPU Architecture
Flag this post
Pick the Right Container
boost.org·3w·
Discuss: r/cpp
⚙️Compilers
Flag this post
Rasterizer Project - Part 3: Geometry
dev.to·4d·
Discuss: DEV
📐Linear Algebra
Flag this post
Abstract or die: Why AI enterprises can't afford rigid vector stacks
venturebeat.com·2w
🌐WebAssembly
Flag this post
Why is AI Generated Rust slow when compared with Go/C#/Node/JavaScript
srid68.github.io·2d·
Discuss: Hacker News
🌐WebAssembly
Flag this post
FlashInfer Bench: A Benchmark Suite for AI Systems That Improve Themselves
flashinfer.ai·2w·
Discuss: Hacker News
⚙️Compilers
Flag this post
Modern perfect hashing
blog.sesse.net·1w·
🔐Cryptography
Flag this post
Half-Quadratic Quantization of large machine learning models
dropbox.tech·2w
⚙️Compilers
Flag this post
Evaluating the Infinity Cache in AMD Strix Halo
chipsandcheese.com·2w·
🧠CPU Architecture
Flag this post
Hydra: Dual Exponentiated Memory for Multivariate Time Series Analysis
arxiv.org·2d
📐Linear Algebra
Flag this post
(PR) Compute-In-Memory APU Achieves GPU-Class AI Performance at a Fraction of the Energy Cost
techpowerup.com·2w
🧠CPU Architecture
Flag this post
A Sparse Polynomial Multiplier for HQC Integrating Parallelism and Power-Based Side-Channel Countermeasures
eprint.iacr.org·1w
📐Linear Algebra
Flag this post