🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🧠 Inference Serving

Request Batching, Model Loading, Throughput Optimization, Latency Management

Baseten raises $150M Series D for inference infra but where’s the real bottleneck?
reddit.com·14h·
Discuss: r/LocalLLaMA
📊Model Serving Economics
EmbeddingGemma Model Card
ai.google.dev·4h·
Discuss: Hacker News
📊Embeddings
EP179: Kubernetes Explained
blog.bytebytego.com·16h
💾Persistence Strategies
Everyone is talking about this new OpenAI paper.
threadreaderapp.com·10h
🆕New AI
Resources, Laziness, and Continuation-Passing Style
journal.infinitenegativeutility.com·2h·
Discuss: Lobsters, Hacker News
💫IO_uring
60-Lesson Course Curriculum : Hands-on System Design with Java Spring Boot
javatsc.substack.com·5h·
Discuss: r/programming
🌐Distributed systems
How We Built Our lakeFS Iceberg Catalog
lakefs.io·1h·
Discuss: Hacker News
🌳LSM Trees
Wrote an in-depth blog on scaling modern transformers with n-D parallelism
jaxformer.com·5h·
Discuss: Hacker News
🕯️Candle
BlazingMQ: A modern, high-performance open message queuing system
github.com·8h·
Discuss: Hacker News
🔄Cache Coherence
Covariant spatio-temporal receptive fields for spiking neural networks
nature.com·6h·
Discuss: Hacker News
⚡Hardware Acceleration
Why Most RAG Pipelines Fail (And How to Fix Them)
pub.towardsai.net·19h
🔄LLM RAG Pipelines
Show HN: Reverse vs. Vectorized Forward Ad: A Performance Exploration in C
raph5.github.io·21h·
Discuss: Hacker News
⚡SIMD Optimization
Vibe Coding Through the Berghain Challenge
nibzard.com·17h·
Discuss: Hacker News
💳Content Monetization
[Level 1] Building Personalized Text Summarization - Following up on Personal Chatbot Success
colab.research.google.com·16h·
Discuss: r/LocalLLaMA
👨‍💻AI Coding
AI could one day replace tutors, but its reliability still lags
phys.org·15h
🏆LLM Benchmarking
Last Week on My Mac: Coming soon to your Mac’s neural engine
eclecticlight.co·59m
📱New tech trends
Relaunching Yakread: an algorithmic reading app
biffweb.com·19h·
Discuss: Hacker News
📰RSS Reading Practices
GPT-5 Thinking in ChatGPT is shockingly good at search and demonstrates the potential of combining tool calling with chain-of-thought reasoning (Simon Willison/...
techmeme.com·9h
💳Content Monetization
The State of AI Browser Agents in 2025
fillapp.ai·23h·
Discuss: Hacker News
🆕New AI
🔗 Understanding stack traces in Elixir
yellowduck.be·18h
🔬Rust Profiling
Loading...Loading more...
AboutBlogChangelogRoadmap