๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿง  Inference Serving

Request Batching, Model Loading, Throughput Optimization, Latency Management

MIGHT: Powerful new algorithm advances reliability of AI with applications in medical diagnostics
medicalxpress.comยท2h
๐Ÿ“ŠVector Databases
bitdrift turns 2: a retrospective
blog.bitdrift.ioยท21h
๐Ÿ’งLitestream
Reranking in Mosaic AI Vector Search for Faster, Smarter Retrieval in RAG Agents
databricks.comยท2h
๐Ÿ†Ranking
OpenAI $500B valuation ๐Ÿ’ฐ, Palantir mafia ๐Ÿ’ผ, building a search engine ๐Ÿ‘จโ€๐Ÿ’ป
tldr.techยท21h
๐Ÿš€Startups
We open-sourced Memori: A memory engine for AI agents
reddit.comยท6hยท
Discuss: r/LocalLLaMA
๐Ÿ’พPersistence Strategies
TokenRec: Learning to Tokenize ID for LLM-based Generative Recommendation
arxiv.orgยท17h
๐Ÿง LLM Inference
Nonparametric learning of stochastic differential equations from sparse and noisy data
arxiv.orgยท17h
๐Ÿง LLM Inference
Retrieval-augmented reasoning with lean language models
arxiv.orgยท17h
๐Ÿ”„LLM RAG Pipelines
The power of two random choices
brooker.co.zaยท19hยท
Discuss: Hacker News
๐ŸŒDistributed systems
Benchmarking Frontends in 2025
reddit.comยท11hยท
Discuss: r/programming
๐Ÿš€Web Performance
A Comprehensive Perspective on Explainable AI across the Machine Learning Workflow
arxiv.orgยท17h
๐Ÿ”AI Interpretability
Prophet Arena: A Live Benchmark for Predictive Intelligence
prophetarena.coยท11hยท
Discuss: Hacker News
๐Ÿ†LLM Benchmarking
MobQA: A Benchmark Dataset for Semantic Understanding of Human Mobility Data through Question Answering
arxiv.orgยท17h
๐Ÿง LLM Inference
AlphaEarth Foundations: a universal embedding for Earth observation data
newsletter.caffeinatedengineer.devยท14hยท
Discuss: Hacker News
๐Ÿ“ŠEmbeddings
Models are smart enough, your process isn't
sibylline.devยท7hยท
Discuss: Hacker News
๐Ÿช„Prompt Engineering
CTRL Your Shift: Clustered Transfer Residual Learning for Many Small Datasets
arxiv.orgยท17h
๐Ÿ“ŠVector Databases
The Impact of Large Language Models (LLMs) on Code Review Process
arxiv.orgยท17h
๐Ÿช„Prompt Engineering
Fine-Grained VLM Fine-tuning via Latent Hierarchical Adapter Learning
arxiv.orgยท17h
๐Ÿง LLM Inference
Handling PII in customer-facing AI chatbots: mask before sending to LLM
hoverbot.aiยท16hยท
Discuss: Hacker News
๐Ÿ›ก๏ธAI Security
gpt-oss-120b & gpt-oss-20b Model Card
arxiv.orgยท17h
๐Ÿ“ฑEdge AI Optimization
Loading...Loading more...
AboutBlogChangelogRoadmap