🤖 Qwen - akapaka · Scour

Casual experiment hint that models seem to search for different stuff

🧠LLM Inference

spock.is··Hacker News

john-rocky/apple-silicon-llm-bench: Neutral, reproducible benchmark for local LLMs on Apple Silicon (Mac · iPhone · iPad) — MLX, llama.cpp, CoreML, Apple Foundation Models

🧠LLM Inference Code

github.com··Hacker News

Less-relevant results

Logits as a new monitor for evaluation awareness

lesswrong.com··Hacker News

Aspen: Own your intelligence

🏠Self-Hosting Discussion Tutorial

runonaspen.com··Hacker News

Ask HN: Is it feasible to run a model on device for complete privacy?

🏠Self-Hosting Discussion

news.ycombinator.com··Hacker News

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

🧠LLM Inference

zozo123.github.io··Hacker News

dotojr123/open-infro-agentc: Open Infro Agentc - Open-source AI-powered desktop automation agent

🔌Model Context Protocol Code

github.com··Hacker News

Google Gemma 4 12B nearly matches 26B benchmarks — and runs on your laptop

🏠Self-Hosting

thenewstack.io·

Progress: real and Potemkin

⚡Tokio Blog

scottlocklin.wordpress.com··Hacker News

alibaba/open-code-review: Battle-tested at Alibaba's scale. Hybrid architecture code review tool: deterministic pipelines + LLM Agent, precise line-level comments, built-in fine-tuned ruleset (NPE, thread-safety, XSS, SQL injection), OpenAI & Anthropic compatible.

🕸️WebAssembly Code

github.com··Hacker News

Riemann-bench | Surge AI

⚡LLM Quantization

surgehq.ai··Hacker News

OPRD: On-Policy Representation Distillation

🧠LLM Inference Academic

arxiv.org··Hacker News

Show HN: One API Key for 45 AI Models – Pay per Token, OpenAI Compatible

modelhub-api.com··Hacker News

linzhiqiu/t2v_metrics: Evaluating text-to-image/video/3D models with VQAScore

🤖Machine Learning Code

github.com··Hacker News

The Anatomy of a Learning Stall

🔌Model Context Protocol Blog

tagide.com··Lobsters, Hacker News, Hacker News, Hacker News

Words do not have determined meanings

⚡LLM Quantization Discussion

news.ycombinator.com··Hacker News

RecursiveIntell/proveKV: Two-tier, receipted, content-addressed KV-cache pool. fib_k4_n32 cold tier + turbo_8bit hot tier. 18-20% lossless dPPL on real 1.7B LLM. Successor to kv-lossless-11x (archived).

🧠LLM Inference Code

github.com··r/LocalLLaMA

Hacker News Trends: Search Hacker News super fast with Redis

🤖Machine Learning

hackernewstrends.com··Hacker News

The OnlyFans Economy of American AI

🧠LLM Inference Blog

leoveanu.com··Hacker News

Anthropic tops AI Arena rankings as it files for IPO

📊Prometheus News Blog

liveclip.substack.com··Substack

Log in to enable infinite scrolling