Vector Instructions, AVX, SSE, Parallel Processing, Performance

Why is AI Generated Rust slow when compared with Go/C#/Node/JavaScript
srid68.github.io·4h·
Discuss: Hacker News
🦀Rust
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.com·2d
🏗️CPU Architecture
Flag this post
Vectorizing for Fun and Performance
ibm.com·6d·
Discuss: Hacker News
🔀Parallel Algorithms
Flag this post
Design of quasi phase matching crystal based on differential gray wolf algorithm
arxiv.org·14h
Shader Programming
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·1d·
Discuss: Hacker News
🎮GPU Programming
Flag this post
CHIP8 – writing emulator, assembler, example game and VHDL hardware impl
blog.dominikrudnik.pl·23h·
Discuss: Hacker News
🔩Assembly
Flag this post
Algorithmic Complexity Reduction via Quantized State Space Search
dev.to·2h·
Discuss: DEV
🔧FPGA
Flag this post
Don't let these 3 CPU specs trick you into paying more
xda-developers.com·23h
🔬RISC-V
Flag this post
Geonum – geometric number library for unlimited dimensions with O(1) complexity
github.com·1d·
Discuss: Hacker News
Shader Programming
Flag this post
Running MiniMax-M2 locally - Existing Hardware Advice
reddit.com·2h·
Discuss: r/LocalLLaMA
📊Performance Tools
Flag this post
How to Design Efficient Memory Architectures for Agentic AI Systems
pub.towardsai.net·50m
🧠Memory Management
Flag this post
Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale
daft.ai·2h·
Discuss: Hacker News
🧠Memory Management
Flag this post
Low-Level Hacks
blog.raycursive.com·17h·
Discuss: Hacker News
🦀Rust
Flag this post
Extensive FPGA and ASIC resource comparison for blind I/Q imbalance estimators and compensators
sciencedirect.com·4h
🔧FPGA
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·2d·
Discuss: Hacker News
Programming
Flag this post
Big-O Notation: Explained in 8 Minutes
blog.algomaster.io·16h
🔀Parallel Algorithms
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·1d
📊Performance Tools
Flag this post
H-FA: A Hybrid Floating-Point and Logarithmic Approach to Hardware Accelerated FlashAttention
arxiv.org·14h
🧠Memory Management
Flag this post
Synopsys and NVIDIA Forge AI Powered Future for Chip Design and Multiphysics Simulation
semiwiki.com·1d
🎮GPU Programming
Flag this post
PAINT25 Invited Talk transcript: “Notational Freedom via Self-Raising Diagrams”
programmingmadecomplicated.wordpress.com·7h
🦀Rust
Flag this post