🤖 AI new techology - Josie · Scour

Google open-sources speedy DiffusionGemma text diffusion model

siliconangle.com·

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

🤖AI native Blog

bric.pe.kr··DEV

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

vettedconsumer.com··Hacker News

Anthropic backtracks on policy that 'sabotaged' researchers' work

🤖AI native News

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

🤖AI Blog

adambien.blog·

Anthropic Reverses Course on Hidden AI Restrictions Following Developer Backlash

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

🤖AI News Blog

developer.nvidia.com·

Google's DiffusionGemma generates 256 tokens in parallel and self-corrects as it goes

venturebeat.com·

Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution

🤖AI News Blog

leetarxiv.substack.com··Substack, r/programming

Apple WWDC On-Device AI Deep Dive - Google Docs

gist.is··Hacker News

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

phoronix.com··r/artificial

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

🤖AI native Blog

dnhkng.github.io·

Score-based diffusion models for accurate crystal-structure inpainting and reconstruction of hydrogen positions

🤖AI Academic

#068 - Apple runs Siri on Google's Gemini, OpenAI files a secret IPO at $852B, Xiaomi clocks 1,000 tps

indiehacker.news·

massimo92/spark: CLI tool for serving LLMs with vLLM on NVIDIA DGX Spark. One file, zero friction.

🤖AI native Code

github.com··Hacker News

SPEAR: A System for Post-Quantization Error-Adaptive Recovery Enabling Efficient Low-Bit LLM Serving

🤖AI native Academic

A system programmer’s guide to LLM inference

🤖AI Blog

blog.xiangpeng.systems··Hacker News

Claude Now Writes 80% of Its Own Code — Anthropic's Self-Improvement Milestone Arrives Faster Than Expected

the-agent-report.com··DEV

DiffusionGemma vs Gemma-4 — Post-OCR Correction

huggingface.co·

Model2vec-zig: static text embeddings in pure Zig, in a single binary

Log in to enable infinite scrolling