Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🧠 LLMs
Specific
large language models, GPT, transformers, fine-tuning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
226
posts in
13.9
ms
Sparse
Mixture-of-Experts
Reward
Models
Learn Interpretable and Specialized
Experts
for Personalized Preference Modeling
🗄️
Vector Databases
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Sparse Mixture-of-Experts Reward Models Learn Interpretable and Specialized Experts for Personalized Preference Modeling
A system programmer’s guide to
LLM
inference
⚙️
Systems Programming
Content type:
Blog
blog.xiangpeng.systems
·
2d
2 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
LeLab Is
Hugging
Face
’s New Browser-Based GUI for the LeRobot Ecosystem
🌐
Open Source
Content type:
News
hackster.io
·
20h
20 hours ago
Actions for LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem
magenta/magenta-realtime: Magenta RealTime 2: An Open-Weights Live Music
Model
💾
Storage Engines
Content type:
Code
github.com
·
5h
5 hours ago
Actions for magenta/magenta-realtime: Magenta RealTime 2: An Open-Weights Live Music Model
What's in the Box? A Field Guide to AI
Models
🤖
AI Agents
Content type:
Blog
iankduncan.com
·
1d
1 day ago
Actions for What's in the Box? A Field Guide to AI Models
Build a Medical Report Analyzer on Dedicated
Inference
with Python
⚙️
Systems Programming
digitalocean.com
·
6d
6 days ago
Actions for Build a Medical Report Analyzer on Dedicated Inference with Python
Microsoft just shared the frontier data
engineering
secrets
🤖
AI Agents
mail.bycloud.ai
·
16h
16 hours ago
Actions for Microsoft just shared the frontier data engineering secrets
Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech
🤖
AI Agents
Content type:
Blog
huggingface.co
·
16h
16 hours ago
Actions for Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech
Xiaomi
MiMo-V2.5-Pro
Just Hit 1,000 Tokens Per Second!
🔗
Networking
gizchina.com
·
1d
1 day ago
Actions for Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!
Nvidia Nemotron 3 Ultra
🤖
AI Agents
research.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for Nvidia Nemotron 3 Ultra
Cohere open-sources a coding agent that runs on a single H100
🤖
AI Agents
venturebeat.com
·
14h
14 hours ago
Actions for Cohere open-sources a coding agent that runs on a single H100
Location: Göttingen, Germany Remote: Yes (preferred; hybrid also
fine
) Willing t...
🤖
AI Agents
Content type:
Discussion
news.ycombinator.com
·
6d
6 days ago
·
Hacker News
Actions for Location: Göttingen, Germany Remote: Yes (preferred; hybrid also fine) Willing t...
Google Gemma 4 12B brings native multimodal AI to standard laptops
🤖
AI Agents
4sysops.com
·
1d
1 day ago
Actions for Google Gemma 4 12B brings native multimodal AI to standard laptops
Less-relevant results
Google fills out the middle with the Gemma 4 12B
🤖
AI Agents
jonpeddie.com
·
21h
21 hours ago
Actions for Google fills out the middle with the Gemma 4 12B
Running
LLM
Inference
on Kubernetes: What It Actually Takes
🖥️
OS
Content type:
Blog
fairwinds.com
·
4d
4 days ago
Actions for Running LLM Inference on Kubernetes: What It Actually Takes
China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)
🔗
Networking
Content type:
News
decrypt.co
·
1d
1 day ago
Actions for China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)
LLM
Research Papers: The 2026 List (January to May)
🤖
AI Agents
Content type:
News
magazine.sebastianraschka.com
·
4d
4 days ago
·
Hacker News
Actions for LLM Research Papers: The 2026 List (January to May)
defai-digital/ax-engine
: Apple Silicon
LLM
runtime supporting Gemma 4 and Qwen 3.6 MTP
modes
🤖
AI Agents
Content type:
Code
github.com
·
11h
11 hours ago
·
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
"North Mini Code"; open weights, 30B param, Canadian coding
model
🤖
AI Agents
Content type:
Blog
cohere.com
·
1d
1 day ago
·
Hacker News
Actions for "North Mini Code"; open weights, 30B param, Canadian coding model
Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
🖥️
OS
local-llm.utop.workers.dev
·
3d
3 days ago
·
Hacker News
Actions for Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help