Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏗️ AI Infrastructure
Model Serving, GPU Clusters, Inference Optimization, MLOps
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
171144
posts in
22.6
ms
AEG
: A
Baremetal
Framework for AI Acceleration via Direct Hardware Access in Heterogeneous Accelerators
⚡
Hardware Acceleration
arxiv.org
·
1d
Collaborative
AI Systems: Human-AI
Teaming
Workflows
🤖
AI Coding Tools
kdnuggets.com
·
16h
SambaNova
and Intel expand partnership with inference architecture to support agentic AI
workloads
🤖
AI Inference
datacenterdynamics.com
·
6d
Stop
benchmarking
inference
providers
, a guide to easy evaluation
🤖
AI Inference
huggingface.co
·
15h
·
r/LocalLLaMA
Artificial Intelligence Lab: A
Practical
Roadmap
to Modern AI Systems
🤖
AI Coding Tools
medium.com
·
2d
A developer’s guide to
architecting
reliable
GPU infrastructure at scale
⚡
Hardware Acceleration
cloud.google.com
·
5d
AI Infrastructure Lock-In: Why
PyTorch
is the Only
Abstraction
Layer That Matters
🔥
PyTorch
youtube.com
·
1d
From AGI to LLMs and hallucinations:
unpacking
confusing
AI terms
📱
Edge AI
digitaltoday.co.kr
·
1d
Blink: CPU-Free LLM Inference by
Delegating
the Serving Stack to GPU and
SmartNIC
💻
Local LLMs
arxiv.org
·
5d
Technology solutions targeting the performance of gen-AI inference in
resource
constrained
platforms
🤖
AI Inference
arxiv.org
·
1d
Fast
Heterogeneous
Serving: Scalable Mixed-Scale LLM Allocation for
SLO-Constrained
Inference
💻
Local LLMs
arxiv.org
·
5d
MoEITS
: A Green AI approach for
simplifying
MoE-LLMs
📱
Edge AI
arxiv.org
·
1d
Tessera
: Unlocking Heterogeneous GPUs through Kernel-Granularity
Disaggregation
⚡
Hardware Acceleration
arxiv.org
·
1d
Networking-Aware
Energy
Efficiency
in Agentic AI Inference: A Survey
🤖
AI Inference
arxiv.org
·
5d
LABBench2
: An Improved Benchmark for AI Systems Performing
Biology
Research
📱
Edge AI
arxiv.org
·
1d
Pioneer
Agent:
Continual
Improvement of Small Language Models in Production
📱
Edge AI
arxiv.org
·
1d
ATANT
: An Evaluation Framework for AI
Continuity
🤖
AI Inference
arxiv.org
·
6d
Sustainability-Constrained
Workload
Orchestration
for Sovereign AI Infrastructure: A Joint Compute-Network Optimization Framework
🌱
Sustainable Computing
arxiv.org
·
1d
Measurement of Generative AI
Workload
Power
Profiles
for Whole-Facility Data Center Infrastructure Planning
🌱
Sustainable Computing
arxiv.org
·
6d
Making Room for AI: Multi-GPU Molecular Dynamics with Deep
Potentials
in
GROMACS
🧬
Computational Biology
arxiv.org
·
6d
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help