🚀 Performance - hugonoss · Scour

ExaBench: An Open Database Performance Leaderboard 🧮Vector Databases

exasol.com·1d·Hacker News

[WIP] Benchmarking Local LLMs Against Coding Agent Harnesses 🦙Ollama

neuralnoise.com·3d·Hacker News

Granite 4.1: IBM's 8B Model Is Competing With Models Four Times Its Size 🦙Ollama

firethering.com·21h·Hacker News

Utilyze measures how efficiently your GPU is doing useful work ⚙Laptop optimization

github.com·13h·Hacker News

TurboQuant on a MacBook Pro, part 2: perplexity, KL divergence, and asymmetric K/V on M5 Max ⬛Ditherpunk

llmkube.com·2d·r/LocalLLaMA

DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles 🧮Vector Databases

lmsys.org·5d·Hacker News

Openweight Benchmark 🧮Vector Databases

openweightbench.pages.dev·14h·Hacker News

KV Cache Locality: The Hidden Variable in Your LLM Serving Cost ⚙Laptop optimization

ranvier.systems·1d·Hacker News

Issue 649 💡New and interesting problems

datascienceweekly.substack.com·10h·Substack

Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks 🦙Ollama

odysseys-website.pages.dev·1d·Hacker News

Show HN: Utilyze, an open source GPU monitoring tool more accurate than nvtop ⚙Laptop optimization

systalyze.com·3d·Hacker News

PEAKS No 42: The Open-Weight Uprising: GPT-5.5, Qwen Beats a 397B Giant, and Your Jira Data Is Now AI Training Fuel 🦙Ollama

bogdandeac.com·2d

Vibing, Harness and OODA loop 🦙Ollama

architecture-weekly.com·4d

Show HN: 1990s Game Dev Algorithms for Distributed Systems 🦙Ollama

docs.merca.earth·2d·Hacker News

GPT-5.5: Capabilities and Reactions 🦙Ollama

thezvi.wordpress.com·2d

Introducing SOB: A Multi-Source Structured Output Benchmark for LLMs 🦙Ollama

interfaze.ai·3d·Hacker News

Reaching SOTA Without Breaking the Bank: Using AI21 Maestro to optimize deep research agents 🦙Ollama

ai21.com·2d·Hacker News

Reimagining Kernel Generation at the PTX Layer: An LLM System Learning from DSLs to Outperform Them 🦙Ollama

standardkernel.com·3d·Hacker News

Containerized data centers help avoid many pitfalls in AI deployments ⌨️Cyberdeck Building

techzine.eu·2d

AI evals are becoming the new compute bottleneck 🦙Ollama

huggingface.co·1d·Hacker News

Log in to enable infinite scrolling