🤖 Qwen

Native Inference Engine for macOS 14 or newer

Discussed on Hacker News

Lin Junyang AI Lab Closes Round at $2B Valuation

Discussed on r/LocalLLaMA

Anyscale blog posts·

High Performance Distributed Inference with Ray Serve LLM

Covered by Google Cloud Blog

Discussed on Hacker News

Built Uber aggregator that tracks top AI researchers and leaders

Discussed on Hacker News

huggingface.co·

bartowski/command-a-plus-05-2026-GGUF

Covers 4 stories including GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...

Discussed on r/LocalLLaMA

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Covered by tldr.tech

Discussed on Hacker News

teachmecoolstuff.com·

Fine Tuning a Tiny Local LLM to Categorize Questions

Discussed on Hacker News and Hacker News

Alex Ellis' Blog·

Local Qwen isn't a worse Opus, it's a different tool

Covered by 4 sources including lemmy.ml, tldr.tech

Discussed on Hacker News, Lobsters, and r/LocalLLaMA

DFlash and Spec V2 Decoding (14 minute read)

Covers 5 stories including Looking for a self-hosted alternative to Modal.com for running ML workloads

Discussed on Hacker News

Brain the Size of a Planet: Are LLMs Thonking too Hard? (30 minute read)

Covers Defense at AI speed: Microsoft’s new multi-model agentic security system tops leading industry benchmark

Covered by tldr.tech

Discussed on Hacker News

Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon

Discussed on Hacker News

7900XTX 24GB vram, can finally fit Q6K+MTP with Qwen 3.6 27B at 131k context

Discussed on r/LocalLLaMA

Show HN: Evaluating Local LLMs as language translators for my app

Discussed on Hacker News

substack.productmind.co·

The US just treated an LLM as a munition

Covers Statement on the US government directive to suspend access to Fable 5 and Mythos 5

Discussed on Hacker News

autonomy-landing-page.vercel.app·

Show HN: Autonomy – Self-Harness/Self-Directed AI Agent Core Under Development

Discussed on Hacker News

Ag.ide Index, rank, and refactor your repo's worst code

Discussed on Hacker News

hackernoon.com·

How I Built a Pipeline to Restore Old B&W Photos to 4K Color Using Open-Source AI

ahwurm/localharness: Model-agnostic agent harness for local LLMs — configure agents in YAML and run them on your own hardware (vLLM, Ollama, LM Studio, llama.cpp).

Covers uv

Discussed on Hacker News

The AI Conundrum: We are living in highly subsidized, interesting times

Discussed on Hacker News

venturebeat.com·

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost

Covers 8 stories including GLM-5.2 (6 minute read)

Covered by 4 sources including vettedconsumer.com, AI Changes Everything

Log in to enable infinite scrolling