Qwen

Feeds to Scour
SubscribedAll
Scoured 57 posts in 7.3 ms

Casual experiment hint that models seem to search for different stuff

 🧠LLM Inference
spock.is··Hacker News

john-rocky/apple-silicon-llm-bench: Neutral, reproducible benchmark for local LLMs on Apple Silicon (Mac · iPhone · iPad) — MLX, llama.cpp, CoreML, Apple Foundation Models

 🧠LLM Inference  Content type: Code
github.com··Hacker News
Less-relevant results

Logits as a new monitor for evaluation awareness

 📊Prometheus
lesswrong.com··Hacker News

Aspen: Own your intelligence

 🏠Self-Hosting  Content type: Discussion  Content type: Tutorial

Ask HN: Is it feasible to run a model on device for complete privacy?

 🏠Self-Hosting  Content type: Discussion

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

 🧠LLM Inference
zozo123.github.io··Hacker News

dotojr123/open-infro-agentc: Open Infro Agentc - Open-source AI-powered desktop automation agent

 🔌Model Context Protocol  Content type: Code
github.com··Hacker News

Google Gemma 4 12B nearly matches 26B benchmarks — and runs on your laptop

 🏠Self-Hosting
thenewstack.io·

Progress: real and Potemkin

 Tokio  Content type: Blog

alibaba/open-code-review: Battle-tested at Alibaba's scale. Hybrid architecture code review tool: deterministic pipelines + LLM Agent, precise line-level comments, built-in fine-tuned ruleset (NPE, thread-safety, XSS, SQL injection), OpenAI & Anthropic compatible.

 🕸️WebAssembly  Content type: Code
github.com··Hacker News

Riemann-bench | Surge AI

 LLM Quantization
surgehq.ai··Hacker News

OPRD: On-Policy Representation Distillation

 🧠LLM Inference  Content type: Academic
arxiv.org··Hacker News

Show HN: One API Key for 45 AI Models – Pay per Token, OpenAI Compatible

 📊Prometheus

linzhiqiu/t2v_metrics: Evaluating text-to-image/video/3D models with VQAScore

 🤖Machine Learning  Content type: Code
github.com··Hacker News

The Anatomy of a Learning Stall

 🔌Model Context Protocol  Content type: Blog

Words do not have determined meanings

 LLM Quantization  Content type: Discussion

RecursiveIntell/proveKV: Two-tier, receipted, content-addressed KV-cache pool. fib_k4_n32 cold tier + turbo_8bit hot tier. 18-20% lossless dPPL on real 1.7B LLM. Successor to kv-lossless-11x (archived).

 🧠LLM Inference  Content type: Code
github.com··r/LocalLLaMA

Hacker News Trends: Search Hacker News super fast with Redis

 🤖Machine Learning

The OnlyFans Economy of American AI

 🧠LLM Inference  Content type: Blog
leoveanu.com··Hacker News

Anthropic tops AI Arena rankings as it files for IPO

 📊Prometheus  Content type: News  Content type: Blog

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help