Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Infra
ποΈ AI Infra
ML infrastructure, model serving, inference, AI platform
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
299
posts in
16.2
ms
π
MCP
biorxiv.org
Β·
16h
16 hours ago
Ambiguity-Aware Multi-Stage Cell-Type Annotation for Spatial Transcriptomics
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Ambiguity-Aware Multi-Stage Cell-Type Annotation for Spatial Transcriptomics
π
Observability
TNW | Artificial-Intelligence
Β·
2d
2 days ago
Qualcomm lands Meta as first named customer for its Dragonfly
data
centre chips
Covered byΒ
tldr.tech
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Qualcomm lands Meta as first named customer for its Dragonfly data centre chips
π
RAG
medium.com
Β·
4d
4 days ago
Why RAG Systems Fail Even When Everything Looks Correct
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Why RAG Systems Fail Even When Everything Looks Correct
βοΈ
Prompt Engineering
cmart's blog
Β·
1d
1 day ago
Inference
Cards
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Inference Cards
π
MLOps
digitalocean.com
Β·
8h
8 hours ago
Why
Serverless
Inference
Consistency Varies on the Same
Model
CoversΒ
4Β stories
See all stories this covers
Β includingΒ
vllm-project/vllm
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Why Serverless Inference Consistency Varies on the Same Model
ποΈ
Vector Databases
docs.vultr.com
Β·
3d
3 days ago
Deploying Qdrant Open-Source
Vector
Database
for
AI
Applications on Ubuntu 24.04
CoversΒ
2Β stories
See all stories this covers
Β includingΒ
Qdrant - Vector Database
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Deploying Qdrant Open-Source Vector Database for AI Applications on Ubuntu 24.04
π§
LLMs
medium.com
Β·
2d
2 days ago
Deep Learning
Inference
: PyTorch, ONNX, and
TensorRT
Explained
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Deep Learning Inference: PyTorch, ONNX, and TensorRT Explained
π
RAG
exa.ai
Β·
11h
11 hours ago
AI
Search
Engine
Exa Raises $250M Series C
Covered byΒ
7Β sources
See all sources covering this story
Β includingΒ
latent.space
,
imjuya.github.io
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AI Search Engine Exa Raises $250M Series C
π§
LLMs
Modal
Β·
2d
2 days ago
Achieve state-of-the-art
inference
latencies with speculative decoding
CoversΒ
DFlash: Block Diffusion for Flash Speculative Decoding
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Achieve state-of-the-art inference latencies with speculative decoding
ποΈ
Vector Databases
Nazar Boyko
Β·
6d
6 days ago
Vector
Databases
Compared: pgvector, Qdrant, Pinecone, Weaviate
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Vector Databases Compared: pgvector, Qdrant, Pinecone, Weaviate
π§
LLMs
Data For Science
Β·
8h
8 hours ago
Book Review: LLMs in Production: From Language
Models
to Successful Products
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Book Review: LLMs in Production: From Language Models to Successful Products
π§
LLMs
Hugging Face
Β·
2d
2 days ago
Qwen-AgentWorld-35B-A3B for Coding?
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Qwen-AgentWorld-35B-A3B for Coding?
ποΈ
Vector Databases
arXiv
Β·
17h
17 hours ago
What Survives When You Compress a Recursive Reasoner for the Edge?
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What Survives When You Compress a Recursive Reasoner for the Edge?
π
LLM Evaluation
NVIDIA Technical Blog
Β·
3d
3 days ago
Boost
Inference
Performance up to 15x on NVIDIA Blackwell Using DFlash Speculative Decoding
CoversΒ
4Β stories
See all stories this covers
Β includingΒ
NVIDIA Blackwell Architecture
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Boost Inference Performance up to 15x on NVIDIA Blackwell Using DFlash Speculative Decoding
π§
LLMs
4sysops
Β·
2d
2 days ago
OpenAI and Broadcom reveal JalapeΓ±o chip to optimize large language
model
inference
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for OpenAI and Broadcom reveal JalapeΓ±o chip to optimize large language model inference
βοΈ
Backend Engineering
Hacker News
Β·
4d
4 days ago
Ask HN: What are some rock solid open source
vector
databases
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Ask HN: What are some rock solid open source vector databases
π€
AI Agents
blocksandfiles
Β·
2d
2 days ago
DDN launches faster array HW and KV Cache SW for
AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for DDN launches faster array HW and KV Cache SW for AI
π
MLOps
mayursurani.medium.com
Β·
4d
4 days ago
MLflow 101: Why
MLOps
Matters and How MLflow Solves the
Model
Deployment
Crisis
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for MLflow 101: Why MLOps Matters and How MLflow Solves the Model Deployment Crisis
ποΈ
Vector Databases
zilliz.com
Β·
1d
1 day ago
Show HN:
Vectordb
benchmark
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Vectordb benchmark
π
MLOps
intelligence-per-watt.ai
Β·
12h
12 hours ago
Intelligence per Watt: A Unified Metric for the
AI
Era
CoversΒ
OpenJarvis: Personal AI, on Personal Devices
Covered byΒ
GitHub
,
hazyresearch.stanford.edu
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Intelligence per Watt: A Unified Metric for the AI Era
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report