LocalLlama · Scour

Bleeding Llama: Critical Unauthenticated Memory Leak in Ollama (CVE-2026–7482)

cyera.com·9h·r/LocalLLaMA, r/netsec

US announces deals with tech firms for national security review of AI models before release

theguardian.com·10h·r/LocalLLaMA

Qwen3.6 27B vs Qwen3.5 27B vs Gemma 4 31B: Accuracy, Latency, Memory, and Token Efficiency Tested

kaitchup.substack.com

·10h·r/LocalLLaMA

ixu2486/tq_compat_eval: Independent TurboQuant-compatible KV backend evaluation SDK for compressed-KV ABI testing, smoke tests, and partial attention decode experiments.

github.com·11h·r/LocalLLaMA

Comparing the best open source TranslateGemma projects

metalglot.com·5d·Hacker News, r/LocalLLaMA

Supercharging LLM inference on Google TPUs: Achieving 3X speedups with diffusion-style speculative decoding

developers.googleblog.com·1d·Hacker News, Hacker News, r/LocalLLaMA

I made a voice controlled Tic-Tac-Toe game as a learning project

github.com·23h·r/LocalLLaMA

Qwen3.6 27B FP8 runs with 200k tokens of BF16 KV cache at 80 TPS on a single RTX 5000 PRO 48GB

huggingface.co·23h·r/LocalLLaMA

Peanut - Text to Image Model (Open Weights coming soon)

xcancel.com·1d·r/LocalLLaMA

[Feature] TurboQuant: support hybrid models and uniform quantization by JartX · Pull Request #39931

github.com·1d·r/LocalLLaMA

White House Considers Vetting A.I. Models Before They Are Released

·1d·Hacker News, r/LocalLLaMA, r/OpenAI, r/singularity

NVIDIA DGX Spark™ + Apple Mac Studio = 4x Faster LLM Inference with EXO 1.0

blog.exolabs.net·28w·Hacker News, Hacker News, Hacker News, r/LocalLLaMA, r/LocalLLaMA

llama + spec: MTP Support by am17an · Pull Request #22673

github.com·1d·r/LocalLLaMA

[Release] TinyMozart v2 85M 🎶

huggingface.co·1d·r/LocalLLaMA

Deep research + report "a la McKinsey" with Hermes Agent and qwen3.6-35b-a3b Q6_K.

github.com·1d·r/LocalLLaMA

50 Prozent mehr Speicher: Ryzen AI Max+ Pro 495 mit Radeon 8065S nutzt 192 GByte RAM

computerbase.de·1d·r/LocalLLaMA

AMD Ryzen AI Max+ PRO 495 leaks out, features Radeon 8065S iGPU and 192GB memory

videocardz.com·2d·r/LocalLLaMA

SearchSavior/Qwen3-TTS-OpenVINO: From scratch qwen3 tts in pytorch, with from scratch openvino implementation on top.

github.com·2d·r/LocalLLaMA

SicariusSicariiStuff/Assistant_Pepe_32B

huggingface.co·2d·r/LocalLLaMA

AMD and Intel Unveil ACE: New matrix instructions deliver a massive 16x AI performance leap over AVX

tweaktown.com·5d·r/LocalLLaMA

Log in to enable infinite scrolling