🚀 Performance - hugonoss · Scour

ExaBench: An Open Database Performance Leaderboard 🧮Vector Databases

exasol.com·1d·Hacker News

Fourth Data Prefetching Championship: Part I ⚙Laptop optimization

sigarch.org·3d

RuC: HDL-Agnostic Rule Completion Benchmark Generation 🔗RAG

Optimization vs. Architecture: Knowing the Difference 🧮Vector Databases

tigerdata.com·2d

MauroCE/m3serve: Optimised BAAI/bge-m3 serving with dense + sparse + ColBERT embeddings, async dynamic batching and pipeline GPU inference 🧮Vector Databases

github.com·4d·r/SideProject

atomic_queue benchmarks SMT vs no-SMT performance ⬛Ditherpunk

max0x7ba.github.io·2d·r/cpp, r/linux

Announcing Arm Performix: Empowering developers with scalable performance for the age of AI agents 🦙Ollama

newsroom.arm.com·2d·Hacker News

[WIP] Benchmarking Local LLMs Against Coding Agent Harnesses 🦙Ollama

neuralnoise.com·3d·Hacker News

TurboQuant on a MacBook Pro, part 2: perplexity, KL divergence, and asymmetric K/V on M5 Max ⬛Ditherpunk

llmkube.com·2d·r/LocalLLaMA

DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles 🧮Vector Databases

lmsys.org·5d·Hacker News

How we built the most performant DeepSeek V3.2, MiniMax-M2.5 and Qwen 3.5 397B on DigitalOcean NVIDIA HGX™ B300 GPU Droplets 🦙Ollama

digitalocean.com·3d

Show HN: Utilyze, an open source GPU monitoring tool more accurate than nvtop ⚙Laptop optimization

systalyze.com·3d·Hacker News

Vibing, Harness and OODA loop 🦙Ollama

architecture-weekly.com·4d

What 2x GH200 delivers: memory paths for LLM inference 💫slick production values

dnhkng.github.io·6d

Introducing SOB: A Multi-Source Structured Output Benchmark for LLMs 🦙Ollama

interfaze.ai·3d·Hacker News

FineState-Bench: Benchmarking State-Conditioned Grounding for Fine-grained GUI State Setting ⬛Ditherpunk

Lambda Calculus Benchmark for AI 🦙Ollama

victortaelin.github.io·6d·Hacker News

70x faster cold(ish) starts for SGLang 💫slick production values

fergusfinn.com·6d·Hacker News

Reimagining Kernel Generation at the PTX Layer: An LLM System Learning from DSLs to Outperform Them 🦙Ollama

standardkernel.com·3d·Hacker News

Optimize Anything with LLMs 🦙Ollama

gepa-ai.github.io·6d

Log in to enable infinite scrolling