⚡ Low-latency - lucifer13 · Scour

Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues

semiengineering.com·14h

🚀Performance

PRISM: Parallel Residual Iterative Sequence Model

arxiv.org·17h

⚡SIMD Optimization

datavorous/spheni: An in-memory vector search library in C++ with Python bindings

github.com·1d·

Discuss: Hacker News

⚡SIMD Optimization

Training-Free Real-Time Control for Autoregressive Video Generation

daydream.live·7h·

Discuss: Hacker News

⚡SIMD Optimization

Block encoding of sparse matrices with a periodic diagonal structure

arxiv.org·17h

⚡SIMD Optimization

A RISC-V vector extension primer

blog.adafruit.com·6h

⚡SIMD Optimization

Discussion - Investigation of Single Thread CPU "Thoughput/cycle"

forums.anandtech.com·23h

🖥️CPU Microarchitecture

Supercharging Inference for AI Factories: KV Cache Offload as a Memory-Hierarchy Problem

blog.min.io·7h

🏗️System Design

Two Ways to Move Tensors Without Stopping: Inside vLLM's Async GPU Transfer Patterns

dev.to·1d·

Discuss: DEV

🚀Performance

Show HN: We Made Nasdaq Parsing Even Faster (and More Reliable)

lunyn.com·2h·

Discuss: Hacker News

🚀Performance

borodark/exmc: Probabilistic programming in BEAM

github.com·1d

⚡SIMD Optimization

ianbarber.blog·18h·

Discuss: Hacker News

🚀Performance

Zvec: SQLite-like simplicity in an embedded vector database (By Alibaba)

zvec.org·9h·

Discuss: Hacker News

📊Vector Database

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

venturebeat.com·32m

🚀Performance

EyesOff: Why Some Models Quantize Better Than Others

ym2132.github.io·23h·

Discuss: Hacker News

⚡SIMD Optimization

Minimum Energy Per Query

semiengineering.com·14h

⚙️Systems Programming

Floating bus technical guide

k1.spdns.de·7h

Show HN: Latent-k – Persistent dependency map to reduce AI coding token usage

latentk.org·1d·

Discuss: Hacker News

⚙️Systems Programming

Porting an INT8 VHDL CNN from Intel Agilex 3 to Lattice Certus-NX

news.ycombinator.com·9h·

Discuss: Hacker News

🖥️CPU Microarchitecture

Memgraph 3.8 is Out: Atomic GraphRAG + Vector Single Store With Major Performance Upgrades

memgraph.com·4h·

Discuss: Hacker News

🚀Performance

Loading more...