Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🤖 AI
artificial intelligence, machine learning, neural networks, LLM
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
232
posts in
16.4
ms
Exploring the Efficiency of
3D-Stacked
AI Chip Architecture for LLM Inference with
Voxel
⚡
Hardware Acceleration
arxiv.org
·
6d
Automated deep learning by
recurrent
hyperparameter
optimization
📐
Vector Search
nature.com
·
1d
AI academic search needs better
frameworks
for understanding and evaluation. These three
librarian
projects are a start
🎯
BM25
aarontay.substack.com
·
1d
·
Substack
My New
Ebook
(Free Download):
Quantization
for Modern AI Systems
📉
Embeddings Optimization
pawankjha.substack.com
·
3d
·
Substack
Category
Theory for Tiny
ML
in Rust
📐
Algorithms
hghalebi.github.io
·
1d
·
Hacker News
AI 101: What’s So
Magical
About
Embeddings
?
🔢
Vector Databases
turingpost.com
·
6d
A
decoder-only
foundation model for time-series
forecasting
📉
Embeddings Optimization
research.google
·
6d
How AI can
streamline
your security testing
🎯
Speculative Execution
redcanary.com
·
6d
Token Arena: A Continuous Benchmark
Unifying
Energy and
Cognition
in AI Inference
♟️
Chess Engines
arxiv.org
·
2d
AGoQ
: Activation and Gradient
Quantization
for Memory-Efficient Distributed Training of LLMs
📉
Embeddings Optimization
arxiv.org
·
2d
Cloud Is Closer Than It Appears:
Revisiting
the
Tradeoffs
of Distributed Real-Time Inference
⚡
Low-Latency Systems
arxiv.org
·
2d
LLM-Emu
: Native
Runtime
Emulation of LLM Inference via Profile-Driven Sampling
⚡
Vectorized Execution
arxiv.org
·
2d
AAFLOW
: Scalable Patterns for Agentic AI
Workflows
🌐
Distributed Systems
arxiv.org
·
1d
Focus Session: Autonomous Systems
Dependability
in the era of AI: Design Challenges in Safety, Security, Reliability and
Certification
🌐
Distributed Systems
arxiv.org
·
5d
SAGA: Workflow-Atomic
Scheduling
for AI Agent Inference on GPU
Clusters
🖥️
GPU Computing
arxiv.org
·
2d
Heterogeneous
Model Fusion for Privacy-Aware Multi-Camera Surveillance via
Synthetic
Domain Adaptation
📉
Embeddings Optimization
arxiv.org
·
1d
AutoSP
: Unlocking Long-Context LLM Training Via Compiler-Based Sequence
Parallelism
🔧
Compilers
arxiv.org
·
5d
·
Hacker News
Stochastic
Sparse
Attention for Memory-Bound Inference
⚡
Vectorized Execution
arxiv.org
·
1d
SplitZip
: Ultra Fast Lossless KV Compression for
Disaggregated
LLM Serving
⚡
Low-Latency Systems
arxiv.org
·
1d
AI Inference as
Relocatable
Electricity Demand: A Latency-Constrained
Energy-Geography
Framework
♟️
Chess Engines
arxiv.org
·
5d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help