Fast Matrix Multiply on an Apple GPU
percisely.xyz·3d·
SIMD Optimization
An enough week
blog.mitrichev.ch·1d·
🧮Z3 Solver
GCC Patches Posted For C++26 SIMD Support
phoronix.com·21h
🔩Systems Programming
Real-Time Adaptive Sparsity Optimization for Edge-Deployed AI Inference Accelerators
dev.to·21h·
Discuss: DEV
🌊Streaming Compression
Can an LLM Be a Black-Box Optimizer?
posgeo.wordpress.com·9m·
Discuss: Hacker News
🧮Kolmogorov Bounds
Randomized and quantum approximate matrix multiplication
arxiv.org·1d
🔐Quantum Cryptography
Can AI Co-Design Distributed Systems? Scaling from 1 GPU to 1k
harvard-edge.github.io·9h·
Discuss: Hacker News
🎯Performance Proofs
Multi-Core By Default
rfleury.com·1d·
🔩Systems Programming
Explicit Lossless Vertex Expanders!
gilkalai.wordpress.com·21h
💎Information Crystallography
Vector Databases - a benchmark
seanpedersen.github.io·3d·
Discuss: Hacker News
🗂️Vector Databases
Leveraging Normalizing Flows for Conservative 6D Beam Reconstruction: Conclusions and Extensions
hackernoon.com·2d
🏺Computational Archaeology
Just shipped Shimmy v1.7.0: Run 42B models on your gaming GPU!
reddit.com·2d·
Discuss: r/rust
🖥️Terminal Renaissance
Automated Anomaly Detection in Time-Series Statistical Spreadsheets via Hyperdimensional Vector Similarity
dev.to·11h·
Discuss: DEV
🔤Character Classification
The Bit Shift Paradox: How "Optimizing" Can Make Code 6× Slower
hackernoon.com·3d
🧮Compute Optimization
Enhancing Vector Signal Generator Accuracy with Adaptive Polynomial Regression Calibration
dev.to·19h·
Discuss: DEV
📡Audio Modulation
Trillion-Scale Goldbach Verification on Consumer Hardware -novel Algorithm [pdf]
zenodo.org·1d·
Discuss: Hacker News
🔢Reed-Solomon Math
Parallelizing Cellular Automata with WebGPU Compute Shaders
vectrx.substack.com·22h·
Discuss: Substack
🔲Cellular Automata
GoMem is a high-performance memory allocator library for Go
github.com·1d
🧠Memory Allocators
BQN "Macros" with •Decompose (2023)
saltysylvi.github.io·9h·
Discuss: Hacker News
🦀Rust Macros