Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🤖 LLM Inference
Specific
Model Serving, Quantization, vLLM, ONNX Runtime
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
192
posts in
13.8
ms
.txt - Structured Outputs for Production LLMs
🧠
LLMs
dottxt.ai
·
6d
LFM2.5-8B-A1B: An Even Better On-Device Mixture-of-Experts
📡
Edge AI
liquid.ai
·
1d
·
Hacker News
,
Hacker News
Llama.cpp
now has an official website:
llama.app
🪟
Tauri
llama.app
·
3h
·
Hacker News
Fusion Funding, AI Export Risks, & GitHub Botnets
📡
Edge AI
briefing.forwardfuture.ai
·
1d
WarpSpeed approaches Speed of Light on Blackwell
⏱️
Latency Engineering
doubleai.com
·
5d
·
Hacker News
,
Hacker News
Delayed
Tensor
Parallelism for Faster Transformer
Inference
🔮
Speculative Decoding
blog.kog.ai
·
1d
·
Hacker News
MobileNet vs EfficientNet-Lite: CIFAR-10 Accuracy for Beginners
📡
Edge AI
tildalice.io
·
1d
How to route external and local LLMs with
Models-as-a-Service
⚙️
MLOps
developers.redhat.com
·
4d
CVE-2026-48710 Starlette Host-Header Auth Bypass
🛡️
Memory Safety
badhost.org
·
2d
·
Lobsters
Structured
LLM
Outputs
🧠
LLMs
dottxt-ai.github.io
·
3d
·
Hacker News
Chat with Large Language
Models
• ellmer
🔮
Speculative Decoding
ellmer.tidyverse.org
·
14h
AI Learning Roadmap: Where to Start if You're a Complete Beginner
📡
Edge AI
fondralabs.com
·
2d
·
DEV
MMBT-Messy-Model-Bench-Tests/hardware-tests/step3.7-flash-nvfp4-dual-blackwell-2026-05-28
at main ·
Light-Heart-Labs/MMBT-Messy-Model-Bench-Tests
🚀
Performance
github.com
·
15h
·
r/LocalLLaMA
TTS doesn't suck anymore
🧠
LLMs
duarteocarmo.com
·
3d
AI
Engineering
for Developers
⚙️
MLOps
lucavall.in
·
2d
Broadcom BCM68850 and BCM55050 SoCs target Wi-Fi 8 and 50G PON fiber gateways
⏱️
Latency Engineering
cnx-software.com
·
1d
Part II: Use GKE managed DRANET with TPUs and autopilot cluster
🍓
Raspberry Pi Clusters
medium.com
·
3d
PaddlePaddle/PaddleOCR-VL-1.6
🤖
AI
huggingface.co
·
1d
·
r/LocalLLaMA
Scalable, Cost-Efficient AI:
Introducing
Unified Batch
Inference
on DigitalOcean
📡
Edge AI
digitalocean.com
·
2d
NeuroEdge:
Real-Time
Hand Gesture Recognition with High-Density EMG Using Deep Learning at the Edge
📡
Edge AI
arxiv.org
·
16h
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help