Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🚀 Model Serving
TorchServe, TensorFlow Serving, Inference Optimization, Batching
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
958
posts in
40.4
ms
ggml
: backend-agnostic tensor parallelism by
JohannesGaessler
· Pull Request #19378
github.com
·
1d
·
Discuss:
r/LocalLLaMA
🧮
Vector Databases
[
RFC
PATCH v1 0/4] Machine Learning (
ML
) library in Linux kernel
lore.kernel.org
·
13h
·
Discuss:
Lobsters
,
Hacker News
🔨
LLVM
Optimized
LLM Inference
Engines
rishirajacharya.com
·
2d
🔨
LLVM
Sequential Attention: Making AI models
leaner
and faster without
sacrificing
accuracy
research.google
·
2d
·
Discuss:
Hacker News
,
r/LocalLLaMA
🤖
Machine Learning
Generative
Pen-Trained
Transformer
theodore.net
·
1d
·
Discuss:
Hacker News
📓
Jupyter Notebooks
Humane
, adaptive AI
bootstrapping
natemeyvis.com
·
19h
🤖
Machine Learning
Agent Economics: a
BOTEC
on
feasibility
lesswrong.com
·
1d
🤖
Transformers
Text classification with Python 3.14's
zstd
module • Max
Halford
maxhalford.github.io
·
1d
·
Discuss:
Lobsters
,
Hacker News
🤖
Transformers
Released:
DeepBrainz-R1
— reasoning-first small models for agentic workflows (
4B
/ 2B
huggingface.co
·
1d
·
Discuss:
Hacker News
,
r/LocalLLaMA
🧠
Deep Learning
ahead-of-time wasm
gc
in
wastrel
wingolog.org
·
17h
·
Discuss:
Lobsters
,
Hacker News
🔨
LLVM
FAR
Labs
Launches Distributed
Compute
Network for AI Inference
hackernoon.com
·
2d
🧠
Deep Learning
Replatforming
Heroku
’s Runtime to Kubernetes: Inside Fir
timlawrenceportfolio.com
·
1d
🔄
Concurrency
felt
the
flow
while programming
skuka.online
·
1d
🔄
Concurrency
Agentic
Proof-Oriented
Programming
risemsr.github.io
·
1d
·
Discuss:
Lobsters
,
Hacker News
🐍
Programming
Yet Another Blog Post About
Programming
With AI
amykhar.bearblog.dev
·
19h
🤖
Transformers
Self-Attention at Constant Cost per Token via
Symmetry-Aware
Taylor
Approximation
arxiv.org
·
4d
·
Discuss:
Hacker News
🤖
Transformers
AI
Grid
: Run LLMs in Your Browser, Share GPU
Compute
with the World
webgpu.com
·
2d
·
Discuss:
r/LocalLLaMA
🧠
Deep Learning
Same Image,
Different
Score
?
halide.cx
·
2d
·
Discuss:
Lobsters
,
Hacker News
🧮
Vector Databases
My AI
Adoption
Journey
mitchellh.com
·
2d
·
Discuss:
Lobsters
,
Hacker News
,
r/LocalLLaMA
🔄
Concurrency
camel-ai/seta-env
: 💻
SETA
: Scaling Environments for Terminal Agents - Environments
github.com
·
20h
·
Discuss:
r/LocalLLaMA
🤖
AI
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help