Can AI Co-Design Distributed Systems? Scaling from 1 GPU to 1k
harvard-edge.github.io·1h·
Discuss: Hacker News
🌐Distributed systems
Fast and robust drift correction for single-molecule localization microscopy
nature.com·12h
🕯️Candle
Benchmarking LLM Inference on RTX 4090 / RTX 5090 / RTX PRO 6000 #2
reddit.com·5h·
Discuss: r/LocalLLaMA
🏗️LLM Infrastructure
From Text to Token: How Tokenization Pipelines Work
paradedb.com·23h
🔤Tokenization
I Built the Perfect Workflow and attracted some friends in the process
graemefawcett.ca·19m·
Discuss: Hacker News
🪄Prompt Engineering
LLM-Based AI Agent That Automates The Transistor Sizing Process (Univ. of Edinburgh)
semiengineering.com·2h
🆕New AI
finetuning Medium or Small language model for factual and memorizing data.
reddit.com·13h·
Discuss: r/LocalLLaMA
🔄LLM RAG Pipelines
Windows 10 support ends in days. Here’s how to switch to Windows 11
nordot.app·6h
💾Persistence Strategies
Truly distributed and 5x more resilient - CockroachDB vs Oracle GDD
cockroachlabs.com·23h
🏗️FoundationDB
QUIC! Jump to User Space!
hackaday.com·7h
QUIC Protocol
Explicit Lossless Vertex Expanders!
gilkalai.wordpress.com·13h
🔬RaBitQ
Operable Software
ferd.ca·10h·
Discuss: Hacker News
🌐Distributed systems
How View Caching in Rails Works (2020)
honeybadger.io·9h·
Discuss: Hacker News
💾Prompt Caching
Looking at my Arduino
boswell.bearblog.dev·6h
🖥️Hardware Architecture
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.ai·23h
📊Feed Optimization
Evaluating Gemini 2.5 Deep Think's math capabilities
epoch.ai·9h·
Discuss: Hacker News
🏆LLM Benchmarking
Multi-Core By Default
rfleury.com·22h·
🧵Concurrency
InferenceMAX – open-source Inference Frequent Benchmarking
github.com·3h·
Discuss: Hacker News
🏗️LLM Infrastructure
How we built a structured Streamlit Application Framework in Snowflake
about.gitlab.com·23h
🔧Developer tools
How Do SSDs Work?
extremetech.com·11h·
Discuss: Hacker News
⚙️Mechanical Sympathy