AI new techology

Feeds to Scour
SubscribedAll
Scoured 584 posts in 6.9 ms

Google open-sources speedy DiffusionGemma text diffusion model

 🤖AI
siliconangle.com·

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

 🤖AI native  Content type: Blog
bric.pe.kr··DEV

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

 🤖AI

Anthropic backtracks on policy that 'sabotaged' researchers' work

 🤖AI native  Content type: News
engadget.com·

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

 🤖AI  Content type: Blog
adambien.blog·

Anthropic Reverses Course on Hidden AI Restrictions Following Developer Backlash

 🤖AI native
devops.com·

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

 🤖AI  Content type: News  Content type: Blog
developer.nvidia.com·

Google's DiffusionGemma generates 256 tokens in parallel and self-corrects as it goes

 🤖AI
venturebeat.com·

Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution

 🤖AI  Content type: News  Content type: Blog

Apple WWDC On-Device AI Deep Dive - Google Docs

 🤖AI
gist.is··Hacker News

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

 🤖AI native

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

 🤖AI native  Content type: Blog
dnhkng.github.io·

Score-based diffusion models for accurate crystal-structure inpainting and reconstruction of hydrogen positions

 🤖AI  Content type: Academic
nature.com·

#068 - Apple runs Siri on Google's Gemini, OpenAI files a secret IPO at $852B, Xiaomi clocks 1,000 tps

 🤖AI native
indiehacker.news·

massimo92/spark: CLI tool for serving LLMs with vLLM on NVIDIA DGX Spark. One file, zero friction.

 🤖AI native  Content type: Code
github.com··Hacker News

SPEAR: A System for Post-Quantization Error-Adaptive Recovery Enabling Efficient Low-Bit LLM Serving

 🤖AI native  Content type: Academic
arxiv.org·

A system programmer’s guide to LLM inference

 🤖AI  Content type: Blog

Claude Now Writes 80% of Its Own Code — Anthropic's Self-Improvement Milestone Arrives Faster Than Expected

 🤖AI
the-agent-report.com··DEV

DiffusionGemma vs Gemma-4 — Post-OCR Correction

 🤖AI
huggingface.co·

Model2vec-zig: static text embeddings in pure Zig, in a single binary

 🤖AI native
ziggit.dev·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help