inference

Feeds to Scour
SubscribedAll
Scoured 161 posts in 6.1 ms

DiffusionGemma: 4x Faster Text Generation

 🤖AI  Content type: News  Content type: Blog

google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation

 🤖AI

Two Leaps to 1000 Tokens/s on a 1T-Parameter Model: On Inference Systems, Execution Boundaries, and Co-Design

 🤖AI  Content type: Blog
tilert.ai··Hacker News

PoQ-Judge: A Multi-Architecture Evaluation Framework for Cost-Aware Proof-of-Quality in Decentralized LLM Inference

 🤖AI  Content type: Academic
arxiv.org·

Redis vs Memorystore: key differences in 2026

 🤖AI  Content type: Blog
redis.io·

Autonomous AI worm uses local models to exploit networks and repair its own code

 🤖AI
4sysops.com·

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

 🤖AI
saintlex.sbs··DEV

🇳🇱 Go/Golang job: Senior Backend Engineer (Go) | Studio AI at Creative Fabrica (Amsterdam, Netherlands)

 🤖AI
golangprojects.com·

Why I care so much about energy per token

 🤖AI  Content type: Blog
ziraph.com··Hacker News

The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic Infrastructure

 🤖AI
devops.com·

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

 🤖AI

[AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo

 🤖AI  Content type: News
latent.space
·

Rate Limits & Anti-Bots in Agentic Scraping

 🤖AI
alterlab.io··DEV

Intro — Sehastrajit

 🤖AI  Content type: Blog
medium.com·

[AINews] FrontierCode: Benchmarking for Code Quality over Slop

 🤖AI  Content type: News
latent.space
·

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

 🤖AI

Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script

 🤖AI  Content type: Code
github.com··Hacker News

What Arm-based innovations happened in May 2026?

 🤖AI  Content type: Blog
newsroom.arm.com·

The 1-Second Timeout Hack: Running Infinite Parallel Workloads Natively on Google Apps Script

 🤖AI  Content type: Blog
medium.com
·

Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation

 🤖AI  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help