Model Serving

Feeds to Scour
SubscribedAll
Scoured 13 posts in 13.0 ms

huawei-csl/KVarN: KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

 🤖AI  Content type: Code
github.com··Hacker News

google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation

 🤖AI

DiffusionGemma: The Developer Guide- Google Developers Blog

 🤖AI  Content type: Blog

DiffusionGemma: 4x Faster Text Generation

 🤖AI  Content type: News  Content type: Blog

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

 🤖AI  Content type: News  Content type: Blog

NVIDIA and LG Group Build an AI Factory to Advance Physical AI, Mobility and AI Infrastructure

 🧠Deep Learning  Content type: Blog

Machinic Psychopharmacology: Do LLMs Self-Medicate?

 🤖AI
lesswrong.com··Hacker News

Youssof Altoukhi (@Youssofal_)

 🤖AI
xcancel.com··r/LocalLLaMA

A drop-in replacement chat template for google/gemma-4-31B-it tuned for open-source agentic coding harnesses.

 🐍Programming

sgl-project/sglang-omni: SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

 🔨LLVM  Content type: Code
github.com·

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

 🤖AI  Content type: Blog

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 🤖AI  Content type: Code
github.com··r/LocalLLaMA

Does anyone know what PCIe mode was used for these benchmarks?

 🤖AI  Content type: Code
github.com··r/LocalLLaMA

No more posts from micaleel's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help