๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ“Š Model Serving Economics

GPU Costs, Inference Pricing, Batch Optimization, Resource Efficiency

Optimal Scheduling Algorithms for LLM Inference: Theory and Practice
arxiv.orgยท5h
๐Ÿง LLM Inference
What Would the Aftermath of the AI Bust Look Like?
thediff.coยท18hยท
Discuss: Hacker News
๐Ÿ†•New AI
Import AI 423: Multilingual CLIP; anti-drone tracking; and Huawei kernel design
jack-clark.netยท23h
๐Ÿ†•New AI
Hessian analysis with JAX: a platform-agnostic, high-performance approach
lesswrong.comยท4h
๐Ÿ•ฏ๏ธCandle
The Revolution of Token-Level Rewards
levroai.comยท18hยท
Discuss: Hacker News
๐Ÿ†LLM Benchmarking
AI Clouds Are Flying Blind: The Illusion of Runtime Protection
thenewstack.ioยท14h
๐Ÿ–ฅGPUs
How LLMs See the World
blog.bytebytego.comยท17h
๐Ÿง LLM Inference
How MLB keeps fans connected to the game โ€“ one cache hit at a time
cloud.google.comยท17h
๐Ÿ–ฅGPUs
Simulated Society of 10k AI Agents
theunwindai.comยท17hยท
Discuss: Hacker News
๐Ÿ†•New AI
Towards American Truly Open Models: The ATOM Project
interconnects.aiยท18h
๐Ÿค–AI
OpenAI Is Winning the AI Race, But Losing the Business Game
hackernoon.comยท16h
๐Ÿ–ฅGPUs
One of the first things I was looking for when I got into dspy was to combine it with offline vllm batch inference.
threadreaderapp.comยท10h
๐Ÿ•ฏ๏ธCandle
The future of AI in Finance: Insights from Nubankโ€™s Tech Leaders at Purple MinDS
building.nubank.comยท15h
๐Ÿ†LLM Benchmarking
SAT Requires Exhaustive Search
link.springer.comยท12hยท
Discuss: Hacker News
๐ŸงฎSMT Solvers
How I turned a general-purpose LLM into a professional code optimization expert with one detailed prompt
reddit.comยท23hยท
Discuss: r/programming
๐Ÿช„Prompt Engineering
On SP1โ€™s Precompiles
mycelias.xyzยท5h
โš™๏ธLanguage Runtimes
Snowflake and Databricks vie for the heart of enterprise AI
nordot.appยท23h
๐Ÿ–ฅGPUs
Mithril launches omnicloud for compute and batch inference
mithril.aiยท14hยท
Discuss: Hacker News
๐Ÿ–ฅGPUs
Lessons from Amazon S3 Vector Store and the Nuances of Hybrid Vector Storage
caylent.comยท18hยท
Discuss: Hacker News
๐Ÿ—๏ธSearch Infrastructure
BOOST: Bayesian Optimization with Optimal Kernel and Acquisition Function Selection Technique
arxiv.orgยท5h
๐Ÿ“ŠStatistical Ranking
Loading...Loading more...
AboutBlogChangelogRoadmap