Vectorization, AVX, SSE, Parallel Processing

Intel XeSS 3 brings 4X multi-frame generation to laptops
xda-developers.com·3w
🧠CPU Architecture
Flag this post
Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs)
furiosa.ai·2w·
Discuss: Hacker News
⚙️Compilers
Flag this post
The next RISC-V processor frontier: AI
edn.com·6d·
Discuss: Hacker News
🧠CPU Architecture
Flag this post
Some Fun Videos on Optimizing NES Code
bumbershootsoft.wordpress.com·4d
⚙️Compilers
Flag this post
SafeRace: WebGPU Memory Safety in the Presence of Data Races
dl.acm.org·3w·
Discuss: Hacker News
🌐WebAssembly
Flag this post
GPU Acceleration with Polars LazyFrames
jtrive.com·1w
🎨Computer Graphics
Flag this post
Emulating human-like adaptive vision for efficient and flexible machine visual perception
nature.com·12h
🎨Computer Graphics
Flag this post
Run LLMs Locally
ikangai.com·18h·
Discuss: Hacker News
⚙️Compilers
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·3d
📡DSP
Flag this post
Does subgroup/wave size matter?
gfxstrand.net·2w·
Discuss: Hacker News
🎨Computer Graphics
Flag this post
CHIP8 – writing emulator, assembler, example game and VHDL hardware impl
blog.dominikrudnik.pl·2d·
Discuss: Hacker News
⚙️Compilers
Flag this post
Optimizing the Plush Interpreter for Faster Raytracing
pointersgonewild.com·3w
🌐WebAssembly
Flag this post
Expert Parallelism: Scaling Mixture-of-Experts Models
digitalocean.com·1w
⚙️Compilers
Flag this post
Developing RISC-V Compute Subsystems
semiengineering.com·1w
🧠CPU Architecture
Flag this post
Battlefield 6 Benchmark: 33 CPUs Tested in Multiplayer
techspot.com·3w
🧠CPU Architecture
Flag this post
Sparse Adaptive Attention “MoE”: How I Solved OpenAI’s $650B Problem With a £700 GPU
medium.com·1w·
🧠CPU Architecture
Flag this post
Fast PEFT Serving at Scale
databricks.com·2w
⚙️Compilers
Flag this post
How to Choose the Right GPU for Your Machine Learning Projects
acecloud.ai·1w·
Discuss: DEV
🎨Computer Graphics
Flag this post
Multi-Core By Default
rfleury.com·3w·
🌐WebAssembly
Flag this post