๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ”„ SIMD Programming

AVX512, Vector Instructions, Loop Unrolling, Auto-vectorization

Compress Vectors by 4x by using 8-bit Rotational Quantization
weaviate.ioยท19hยท
Discuss: Hacker News
๐Ÿ—œ๏ธVector Compression
Why I Ditched Malloc for AI Inference
gilli.devยท6hยท
Discuss: Hacker News
๐Ÿง Memory Allocation Strategies
A Smarter Path To Chiplets Through An Enhanced Multi-Die Solution
semiengineering.comยท19h
๐Ÿ”ฌChip Fabrication
MAPo : Motion-Aware Partitioning of Deformable 3D Gaussian Splatting for High-Fidelity Dynamic Scene Reconstruction
arxiv.orgยท22h
โšกHardware Acceleration
Next-generation graph computing with electric current-based and quantum-inspired approaches
nature.comยท16h
โšกHardware Acceleration
GPUPrefixSums โ€“ state of the art GPU prefix sum algorithms
github.comยท14hยท
Discuss: Hacker News
โš™๏ธMechanical Sympathy
the annotated transformer (2022) | hacker news
news.ycombinator.comยท20hยท
Discuss: Hacker News
๐Ÿง LLM Inference
Iโ€™ve been working on something new:
threadreaderapp.comยท13h
๐Ÿช„Prompt Engineering
AMD MI300X for LLM Serving Disaggregating Prefill and Decode with SGLang
rocm.blogs.amd.comยท2hยท
Discuss: Hacker News
๐Ÿ’พPrompt Caching
Compilation vs. vectorization, search engine edition
jpountz.github.ioยท13hยท
Discuss: Hacker News
๐Ÿ”Query Optimization
Gpt-oss Fine-tuning - now with 60K context length and fits on <13GB VRAM
reddit.comยท8hยท
Discuss: r/LocalLLaMA
๐Ÿ“ŠModel Serving Economics
Notes on Programming in C by Rob Pike
lysator.liu.seยท1hยท
Discuss: Hacker News
๐Ÿ’ปProgramming languages
Byte Tank - Pedro Lopes Blog
lopespm.comยท1h
๐Ÿช„Prompt Engineering
Speeding up Firefox Local AI Runtime
blog.mozilla.orgยท9hยท
Discuss: Hacker News
โšกHardware Acceleration
The (Data) Plot Thickens
hackaday.comยท18h
๐Ÿ“ŠVector Databases
AMD details how it built a product line-up with just two RDNA 4 dies โ€” Flexible design and asymmetric harvesting enables production of multiple models without n...
tomshardware.comยท14h
๐Ÿ–ฅGPUs
From Black Box to Blueprint
martinfowler.comยท15h
๐Ÿ”Binary Analysis
What I learned vibe coding a WASM CSV Parser
importcsv.comยท7hยท
Discuss: Hacker News
๐ŸŒฟLeptos
Explaining the Need for Strongly Happens Before in C++
nekrozqliphort.github.ioยท23hยท
Discuss: Hacker News, r/cpp
๐Ÿ”„Cache Coherence
Let's Build a Hypervisor with KVM
evilcookie.deยท14hยท
Discuss: Hacker News
๐Ÿ’ซIO_uring
Loading...Loading more...
AboutBlogChangelogRoadmap