🔓 Open Source AI - fediversial · Scour

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

androidauthority.com·

A new chapter of efficient foundation models for medical imaging

🖥️Retro Computing

techcommunity.microsoft.com

·

Researchers Build Self-Replicating AI Worm That Operates Entirely on Local, Open-Weight Models

thehackernews.com·

Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change

🖥️Retro Computing News Blog

andreaborio.substack.com··Substack

Local LLMs, Buy a GPU, and the Case for Cognitive Security

🖥️Retro Computing

briefing.forwardfuture.ai·

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

🖥️Retro Computing Blog

towardsai.net·

Google releases Gemma 4 12B with encoder-free multimodal architecture

A system programmer’s guide to LLM inference

🦬Emacs Blog

blog.xiangpeng.systems··Hacker News

magenta/magenta-realtime: Magenta RealTime 2: An Open-Weights Live Music Model

♊Gemini Protocol Code

HNSW vs LSH: How Elasticsearch hits 0.99 recall@10 at 15,000 QPS — and what it costs

🖧BSD Blog

local llm on laptop 780M GPU using llama + gemma 4 qat

🦬Emacs Blog

alper.bearblog.dev·

Optimal Post-Training Quantization Scales and Where to Find Them

🔌Single-Board Computers Academic

China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)

♊Gemini Protocol News

Build a Medical Report Analyzer on Dedicated Inference with Python

♊Gemini Protocol

digitalocean.com·

"North Mini Code"; open weights, 30B param, Canadian coding model

🔌Single-Board Computers Blog

cohere.com··Hacker News

AI Serving Platform That Adapts to Your Model

🏠Self-Hosting Blog

databricks.com·

DiffusionGemma

🖥️Retro Computing

simonwillison.net·

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

🖥️Retro Computing Blog

dnhkng.github.io·

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

🖥️Retro Computing

venturebeat.com··Hacker News

WWDC 2026: Foundation Models (& Anarlog)

skushagra.com·

Sign up or log in to see more results

Log in to enable infinite scrolling