Multi-Core By Default
rfleury.com·20h·
🧵Concurrency
N8n vs. Windmill vs. Temporal
blog.arcbjorn.com·22h·
Discuss: Hacker News
🔄Async Runtimes
GoMem is a high-performance memory allocator library for Go
github.com·19h
🧠Memory Allocators
Explicit Lossless Vertex Expanders!
gilkalai.wordpress.com·12h
🔬RaBitQ
Benchmarking LLM Inference on RTX 4090 / RTX 5090 / RTX PRO 6000 #2
reddit.com·3h·
Discuss: r/LocalLLaMA
🏗️LLM Infrastructure
OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
arxiv.org·18h
🧠LLM Inference
A sprinkle of Rust - bind, don't rewrite, in the age of MCP servers
medium.com·23h·
Discuss: r/rust
🦀Rust
Item Patterns and Struct Await
noratrieb.dev·16h·
Discuss: Hacker News
🦀Rust
How we built a structured Streamlit Application Framework in Snowflake
about.gitlab.com·22h
🔧Developer tools
MECE — The AI Principle You’ll Never Stop Using After Reading This
pub.towardsai.net·11h
🔍AI Interpretability
Parallelizing Cellular Automata with WebGPU Compute Shaders
vectrx.substack.com·12h·
Discuss: Substack
🏟️Arena Allocators
When Python can't thread: a deep-dive into the GIL's impact
pythonspeed.com·10h·
Discuss: Hacker News
🧵Concurrency
Profiling Your Code: 5 Tips to Significantly Boost Performance
usenix.org·20h
Systems Performance
Patience and Willingness to Be Slow
lesswrong.com·9h
🪄Prompt Engineering
Implementing ZADD If Key Exists
rozumem.xyz·16h·
Discuss: Hacker News
🔒Borrow Checker
Scaling Time-Series Data for AI Models
singlestore.com·6h
🎛️Feed Filtering
VLLM Predicted Outputs
cascadetech.ai·1h·
Discuss: Hacker News
🏗️LLM Infrastructure
Show HN: I built a SaaS in 8 weeks, solo, using our own AI platform
zine.ai·4h·
Discuss: Hacker News
🛠️Solo SaaS Tools