Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
💰 Compute Costs
Specific
GPU cost, training cost, inference cost, FLOP pricing, cloud spend
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
198936
posts in
25.1
ms
Unraveling
GPU Inference Costs for
Fine-tuned
Open-source Models V/S Closed Platforms
📊
Model Serving Economics
mlops.community
·
1d
Autodata
: an automatic data
scientist
to create high-quality data (5 minute read)
⚙️
AI Automation
facebookresearch.github.io
·
3d
STOP: Structured On-Policy
Pruning
of Long-Form Reasoning in Low-Data
Regimes
🤖
LLM
arxiv.org
·
19h
Cheaper Cloud Strategy: Why Cost
Reduction
Without Architecture Changes
Fails
🏛️
Technical Architecture
rack2cloud.com
·
1d
·
DEV
Investment notes:
Deci
US$
9.1m
Seed
🤖
AI News
squarepeg.vc
·
1d
Budgeted
Attention Allocation:
Cost-Conditioned
Compute Control for Efficient Transformers
⚡
LLM Optimization
arxiv.org
·
6d
AESOP
: Adversarial Execution-path Selection to
Overload
Deep Learning Pipelines
🛡️
AI Security
arxiv.org
·
1d
Towards Generation-Efficient
Uncertainty
Estimation
in Large Language Models
🤖
LLM
arxiv.org
·
6d
Dynamic
Execution
Commitment
of Vision-Language-Action Models
⚙️
LLMOps
arxiv.org
·
1d
On
Variance
Reduction in Learning Mean
Flows
⚡
LLM Optimization
arxiv.org
·
2d
StreamPhy
: Streaming Inference of
High-Dimensional
Physical Dynamics via State Space Models
🌊
Stream Processing
arxiv.org
·
3d
Uncertainty-Aware Token
Importance
Estimation in
Spiking
Transformers
🔌
Neural Interfaces
arxiv.org
·
2d
Tyche
: One Step Flow for Efficient
Probabilistic
Weather Forecasting
⚡
LLM Optimization
arxiv.org
·
3d
ConQuR
: Corner Aligned Activation Quantization via Optimized
Rotations
for LLMs
⚡
LLM Optimization
arxiv.org
·
2d
GRC
:
Unifying
Reasoning-Driven Generation, Retrieval and Compression
🤖
LLM
arxiv.org
·
1d
Scene-Adaptive
Continual
Learning for
CSI-based
Human Activity Recognition with Mixture of Experts
👁️
Perceptual Hashing
arxiv.org
·
6d
AAAC
: Activation-Aware Adaptive
Codebooks
for 4-bit LLM Weight Quantization
⚡
LLM Optimization
arxiv.org
·
2d
Efficient and Adaptive Human
Activity
Recognition via LLM
Backbones
⚡
LLM Optimization
arxiv.org
·
1d
Selective Rollout: Mid-Trajectory
Termination
for Multi-Sample Agent
RL
🌍
World Models
arxiv.org
·
6d
OmniRefine
: Alignment-Aware Cooperative Compression for Efficient
Omnimodal
Large Language Models
⚡
LLM Optimization
arxiv.org
·
1d
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help