Performance Analysis, Call Graphs, Bottleneck Detection, Instrumentation

Benchmarking LLM Inference on RTX 4090 / RTX 5090 / RTX PRO 6000 #2
reddit.com·17h·
Discuss: r/LocalLLaMA
Performance
‎💥 Load Testing: How to Ensure Your Web Application Thrives Under Pressure
dev.to·42m·
Discuss: DEV
📊Profiling
AAS: The Metric for Monitoring DB Performance
kylehailey.com·8h·
Discuss: Hacker News
📈Performance Tools
Profiling Your Code: 5 Tips to Significantly Boost Performance
usenix.org·1d
📈Performance Tools
Assuring Agent Safety Evaluations By Analysing Transcripts
lesswrong.com·1d
Effect Inference
Reduce Variability and Optimize Routing Accuracy in Business Central
dmsiworks.com·14h·
Discuss: DEV
JIT Optimizations
Debugging Humidity: Lessons from deploying software in the physical world
physical-ai.ghost.io·15h·
Discuss: Hacker News
🛡️Error Boundaries
From Dashboards to Decisions: Building Self Service BI That Scales with AI
dev.to·4h·
Discuss: DEV
↔️Bidirectional Sync
Exploring OpenTelemetry Priorities for Mainframes - Insights from Survey Responses
opentelemetry.io·18h
📦Dependency Analysis
How to Eliminate DevOps Toil Using Automation Scripts
devops.com·23h
🗑️Dead Code
Show HN: I made a Google Analytics alternative that's easy and user-friendly
statflows.com·9h·
Discuss: Hacker News
📊Profiling
Together AI's ATLAS adaptive speculator delivers 400% inference speedup by learning from workloads in real-time
venturebeat.com·22h
🗺️Region Inference
InferenceMAX – open-source Inference Frequent Benchmarking
github.com·15h·
Discuss: Hacker News
📡Erlang BEAM
Review: Systems Performance
kuniga.me·1d
📊perf Tools
Creating Real-Time Multimodal AI Pipelines: Scaling File Processing to 50M Daily Uploads
engineering.salesforce.com·11h
🎮Language Ergonomics
From Static Rate Limiting to Adaptive Traffic Management in Airbnb’s Key-Value Store
medium.com·1d
🎯Ring Buffers
Tracking AI product usage without exposing sensitive data
rudderstack.com·9h·
Discuss: r/programming
🧠Semantic Parsing
TBM 383: Maximizers vs. Focusers
substackcdn.com·20h·
Discuss: Substack
📡Async Channels
Can AI Co-Design Distributed Systems? Scaling from 1 GPU to 1k
harvard-edge.github.io·13h·
Discuss: Hacker News
📋Task Queues