Vector Instructions, AVX, SSE, Parallel Processing, Performance

Geonum – geometric number library for unlimited dimensions with O(1) complexity
github.com·1d·
Discuss: Hacker News
Shader Programming
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.me·4d·
Discuss: Hacker News
🎮GPU Programming
Flag this post
Programming for Computations: Matlab/Octave
link.springer.com·1d·
Discuss: Hacker News
🎮GPU Programming
Flag this post
Synopsys and NVIDIA Forge AI Powered Future for Chip Design and Multiphysics Simulation
semiwiki.com·1d
🎮GPU Programming
Flag this post
A Friendly Tour of Process Memory on Linux
0xkato.xyz·17h·
Discuss: Hacker News
🧠Memory Management
Flag this post
Some Fun Videos on Optimizing NES Code
bumbershootsoft.wordpress.com·2d
🧠Memory Management
Flag this post
Free Functions Don't Change Performance (Much)
16bpp.net·1d·
Discuss: Hacker News, r/cpp
🦀Rust
Flag this post
Limitations of a two-pass assembler
boston.conman.org·13h
🔩Assembly
Flag this post
Detailed Technical Documentation on AI Implementation Logic (Taking Large Language Models as an Example )
nbtab.com·7h·
Discuss: DEV
🧠Memory Management
Flag this post
Playing Around with ARM Assembly
blog.nobaralabs.com·12h·
Discuss: Hacker News
🔩Assembly
Flag this post
We found embedding indexing bottleneck in the least expected place: JSON parsing
nixiesearch.substack.com·1d·
Discuss: Substack
🦀Rust
Flag this post
Inline vs. Pipeline Ray Tracing
evolvebenchmark.com·2h·
Discuss: Hacker News
🌟Ray Tracing
Flag this post
GDM: Consistency Training Helps Limit Sycophancy and Jailbreaks in Gemini 2.5 Flash
lesswrong.com·14m
Shader Programming
Flag this post
How to build a Heapless Vector using `MaybeUninit<T>` for Better Performance.
dev.to·4h·
Discuss: DEV
🦀Rust
Flag this post
Predicting & Mitigating Data Corruption in Pure Storage Flash Arrays via Adaptive Bit Error Rate Modeling
dev.to·5h·
Discuss: DEV
🔧FPGA
Flag this post
Dive into Systems
diveintosystems.org·23h·
Discuss: Hacker News
🖥️Operating Systems
Flag this post
How NVIDIA GeForce RTX GPUs Power Modern Creative Workflows
blogs.nvidia.com·2h
🎮Game Engines
Flag this post
Co-Simulation Framework for Parallel DNN Execution on Chiplet-Based Systems (UW–Madison, Washington State)
semiengineering.com·19h
🎮GPU Programming
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.org·11h
🦀Rust
Flag this post