⚡ SIMD Optimization - emschwartz · Scour

Custom Kernels for All from Codex and Claude

huggingface.co·15h

Show HN: We Made Nasdaq Parsing Even Faster (and More Reliable)

lunyn.com·19h·

Discuss: Hacker News

⚡Vectorized Execution

Breaking the Tractability Barrier: A Generic Low-Level Solver for NP-Hard Instances (N=63) on Commodity 64-Bit Silicon

zenodo.org·8h·

Discuss: r/programming

🧮SMT Solvers

Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

machinelearning.apple.com·15h

🏗️LLM Infrastructure

[Development] 4MB 32-bit SRAM for the MicroMac Performer

68kmla.org·16h

⚙️Mechanical Sympathy

ml-rust/fluxbench: Benchmarking framework with crash isolation, bootstrap statistics, and CI integration

github.com·2h·

Discuss: r/rust

🔬Rust Profiling

5 Days, One GPU Gameboy Swarm

bkase.io·1h·

Discuss: Hacker News

⚙️Mechanical Sympathy

Compiler-Guided Inference-Time Adaptation: Improving GPT-5 Programming Performance in Idris

arxiv.org·10h

⚙Rust Compiler Internals

MiniMaxAI/MiniMax-M2.5

huggingface.co·1h·

Discuss: Hacker News, r/LocalLLaMA

🏆LLM Benchmarking

AMD Video Decode Now Unified Between RadeonSI & RADV Vulkan Video

phoronix.com·21h

⚡Hardware Acceleration

DeepSeek-V3.2 on GB300: Performance Breakthrough

blog.vllm.ai·15h

🏗️LLM Infrastructure

polyrhachis/macrograd: A lightweight autograd engine inspired by PyTorch and micrograd

github.com·1h·

Discuss: Hacker News

KBVQ-MoE: KLT-guided SVD with Bias-Corrected Vector Quantization for MoE Large Language Models

arxiv.org·10h

🎯Vector Quantization

Data Leakage in Machine Learning: Why You Must Split Before Preprocessing

pub.towardsai.net·11h

🛡️AI Security

Index Compression, Query Execution Improvements

marginalia.nu·15h

📇Index Selection

AI captures particle accelerator behavior to optimize machine performance

phys.org·1h

Profiling on Windows: A Short Rant

mropert.github.io·19m·

Discuss: Hacker News

⚡Systems Performance

A stack-buffer-overflow exercise with AddressSanitizer and PostgreSQL

enterprisedb.com·13h·

Discuss: Lobsters, Hacker News

Linux 7.0 MM Changes Bring Some Very Nice Performance Optimizations

phoronix.com·13h

MiniMaxAI MiniMax-M2.5 has 230b parameters and 10b active parameters

openhands.dev·18h·

Discuss: r/LocalLLaMA

🏆LLM Benchmarking

Loading more...