Benchmarking LLM Inference on RTX 4090 / RTX 5090 / RTX PRO 6000 #2
reddit.comยท9hยท
Discuss: r/LocalLLaMA
๐Ÿ—๏ธLLM Infrastructure
Assuring Agent Safety Evaluations By Analysing Transcripts
lesswrong.comยท17h
๐Ÿ†LLM Benchmarking
AAS: The Metric for Monitoring DB Performance
kylehailey.comยท17mยท
Discuss: Hacker News
๐Ÿ“ŠDatabase Profiling
OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
arxiv.orgยท23h
๐Ÿง LLM Inference
Building a Scalable QA Automation Strategy: The 90-Day Roadmap
codemeetscapital.bearblog.devยท10h
๐Ÿ‘จโ€๐Ÿ’ปSoftware development practices
Supercharge your Enterprise BI: How to approach your migration to AI/BI
databricks.comยท6h
๐Ÿ—๏ธInfrastructure Economics
MECE โ€” The AI Principle Youโ€™ll Never Stop Using After Reading This
pub.towardsai.netยท16h
๐Ÿ”AI Interpretability
simplicity โ€ข Pragmatic Dave Thomas & Sarah Taraporewalla
buzzsprout.comยท9hยท
Discuss: r/programming
โšกDeveloper Experience
InferenceMAX โ€“ open-source Inference Frequent Benchmarking
github.comยท7hยท
Discuss: Hacker News
๐Ÿ—๏ธLLM Infrastructure
Tracking AI product usage without exposing sensitive data
rudderstack.comยท1hยท
Discuss: r/programming
๐Ÿ“ŠFeed Optimization
When Python can't thread: a deep-dive into the GIL's impact
pythonspeed.comยท15hยท
Discuss: Hacker News
๐ŸงตConcurrency
Operable Software
ferd.caยท13hยท
Discuss: Hacker News
๐ŸŒDistributed systems
Can AI Co-Design Distributed Systems? Scaling from 1 GPU to 1k
harvard-edge.github.ioยท5hยท
Discuss: Hacker News
๐ŸŒDistributed systems
Let's Write a Macro in Rust
hackeryarn.comยท11hยท
Discuss: Hacker News
๐ŸŽญRust Macros
Scaling Time-Series Data for AI Models
singlestore.comยท12h
๐ŸŽ›๏ธFeed Filtering
VLLM Predicted Outputs
cascadetech.aiยท6hยท
Discuss: Hacker News
๐Ÿ—๏ธLLM Infrastructure
How different AI engines generate and cite answers
searchengineland.comยท15h
๐Ÿ“ŠFeed Optimization
Show HN: I made a Google Analytics alternative that's easy and user-friendly
statflows.comยท1hยท
Discuss: Hacker News
๐Ÿš€Web Performance
From Toil to Empowerment: Building Self-Service Ingress with GitOps
usenix.orgยท23h
๐ŸŒDistributed systems