LLMs

Feeds to Scour
SubscribedAll
Scoured 226 posts in 13.9 ms

Sparse Mixture-of-Experts Reward Models Learn Interpretable and Specialized Experts for Personalized Preference Modeling

 🗄️Vector Databases  Content type: Academic
arxiv.org·

A system programmer’s guide to LLM inference

 ⚙️Systems Programming  Content type: Blog

LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem

 🌐Open Source  Content type: News
hackster.io·

magenta/magenta-realtime: Magenta RealTime 2: An Open-Weights Live Music Model

 💾Storage Engines  Content type: Code
github.com·

What's in the Box? A Field Guide to AI Models

 🤖AI Agents  Content type: Blog
iankduncan.com·

Build a Medical Report Analyzer on Dedicated Inference with Python

 ⚙️Systems Programming
digitalocean.com·

Microsoft just shared the frontier data engineering secrets

 🤖AI Agents
mail.bycloud.ai·

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

 🤖AI Agents  Content type: Blog
huggingface.co·

Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!

 🔗Networking
gizchina.com·

Nvidia Nemotron 3 Ultra

 🤖AI Agents

Cohere open-sources a coding agent that runs on a single H100

 🤖AI Agents
venturebeat.com·

Location: Göttingen, Germany Remote: Yes (preferred; hybrid also fine) Willing t...

 🤖AI Agents  Content type: Discussion

Google Gemma 4 12B brings native multimodal AI to standard laptops

 🤖AI Agents
4sysops.com·
Less-relevant results

Google fills out the middle with the Gemma 4 12B

 🤖AI Agents
jonpeddie.com·

Running LLM Inference on Kubernetes: What It Actually Takes

 🖥️OS  Content type: Blog
fairwinds.com·

China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)

 🔗Networking  Content type: News
decrypt.co·

LLM Research Papers: The 2026 List (January to May)

 🤖AI Agents  Content type: News

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 🤖AI Agents  Content type: Code
github.com··Hacker News

"North Mini Code"; open weights, 30B param, Canadian coding model

 🤖AI Agents  Content type: Blog
cohere.com··Hacker News

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

 🖥️OS

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help