Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
artificial intelligence
🤖 artificial intelligence
Broad
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
215
posts in
7.4
ms
On-device AI is a margin decision
⚙️
AI Automation
Content type:
Blog
ziraph.com
·
16h
16 hours ago
·
Hacker News
Actions for On-device AI is a margin decision
A system programmer’s guide to LLM
inference
🧠
LLMs
Content type:
Blog
blog.xiangpeng.systems
·
3d
3 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
HNSW vs LSH: How Elasticsearch hits 0.99 recall@10 at 15,000 QPS — and what it costs
🔬
AI Research Tools
Content type:
Blog
elastic.co
·
2d
2 days ago
Actions for HNSW vs LSH: How Elasticsearch hits 0.99 recall@10 at 15,000 QPS — and what it costs
Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent
✍️
Prompt Engineering
Content type:
News
spectrum.ieee.org
·
23h
23 hours ago
·
Hacker News
Actions for Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent
China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)
📰
AI News
Content type:
News
decrypt.co
·
2d
2 days ago
·
Hacker News
Actions for China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)
Magenta RealTime 2: Open and Local Live Music
Models
🎨
AI for Creators
magenta.withgoogle.com
·
6d
6 days ago
·
Hacker News
,
Hacker News
,
r/LocalLLaMA
Actions for Magenta RealTime 2: Open and Local Live Music Models
Loss Landscape Diagnosis for Gradient-Based Gray-Scott System Inversion: Disentangling the Roles of PINN Components
🤖
AI Agents
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Loss Landscape Diagnosis for Gradient-Based Gray-Scott System Inversion: Disentangling the Roles of PINN Components
Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!
🎼
Agent Orchestration
gizchina.com
·
2d
2 days ago
Actions for Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production llm serving.
🧠
LLMs
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
,
r/LLM
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
Two Leaps to 1000 Tokens/s on a 1T-Parameter
Model
: On
Inference
Systems, Execution Boundaries, and Co-Design
⚙️
AI Automation
Content type:
Blog
tilert.ai
·
2d
2 days ago
·
Hacker News
Actions for Two Leaps to 1000 Tokens/s on a 1T-Parameter Model: On Inference Systems, Execution Boundaries, and Co-Design
TurboQuant in PostgreSQL
⚛️
Quantum Computing
Content type:
Blog
blog.mayflower.de
·
2h
2 hours ago
Actions for TurboQuant in PostgreSQL
Re-quantizing
a local LLM 14x faster by skipping the tensors that didn't change
🧠
LLMs
Content type:
News
Content type:
Blog
andreaborio.substack.com
·
22h
22 hours ago
·
Substack
Actions for Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change
The latest Gemma 4
models
use a training trick to slash their on-device memory footprint
🧠
LLMs
androidauthority.com
·
5d
5 days ago
Actions for The latest Gemma 4 models use a training trick to slash their on-device memory footprint
Alduin 4B, an uncensored Vision LLm just released.
🦀
openclaw
huggingface.co
·
12h
12 hours ago
·
r/StableDiffusion
Actions for Alduin 4B, an uncensored Vision LLm just released.
The Smallest Brain You Can Build: A Perceptron in Python
🧠
LLMs
Content type:
Discussion
news.ycombinator.com
·
2d
2 days ago
·
Hacker News
Actions for The Smallest Brain You Can Build: A Perceptron in Python
MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
🧠
LLMs
Content type:
News
Content type:
Blog
kaitchup.substack.com
·
5d
5 days ago
·
r/LocalLLaMA
Actions for MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
The week AI infrastructure crossed from a technology story to a
financial
one
🤖
claude code
Content type:
News
mlwhiz.com
·
10h
10 hours ago
Actions for The week AI infrastructure crossed from a technology story to a financial one
Train
Models
Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
🧠
LLMs
Content type:
News
Content type:
Blog
developer.nvidia.com
·
2d
2 days ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
Machine
Learning
With Manya: The Great Toy Shop Whisper Game
📦
AI Product Launches
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for Machine Learning With Manya: The Great Toy Shop Whisper Game
Generalizable self-supervised
learning
for imaging flow cytometry on multi-dataset leukocyte differential
⚙️
AI Automation
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for Generalizable self-supervised learning for imaging flow cytometry on multi-dataset leukocyte differential
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help