Building a 60,000 RPS Time-Series Data Ingestion Pipeline in Go
tsharma.bearblog.dev·1d
📮Persistent Queues
RAG-Fusion Multimodal: The Theory Behind Local Document Intelligence
pub.towardsai.net·2d
🚀Tokenizer Performance
Rustchain: Enterprise AI Agent Framework with Universal Workflow Transpilation (LangChain → GitHub Actions, Airflow, K8s)
reddit.com·1d·
Discuss: r/rust
🚂Cranelift IR
Building Language Tech for Meghalaya: Lessons from Tokenizing Khasi and Garo with Modern LLMs
dev.to·1d·
Discuss: DEV
Tokenizer Benchmarks
Building Search for this Site – Search on a static site
alexleighton.com·1d·
Discuss: Hacker News
📋Tablegen
Benchmarking Document Parsing (and What Actually Matters)
unstructured.io·16h
🧠Semantic Parsing
GSoC 2025: Improving Core Clang-Doc Functionality
blog.llvm.org·16h
📋Tablegen
PILOT: Steering Synthetic Data Generation with Psychological & Linguistic Output Targeting
arxiv.org·12h
💬Interactive REPLs
Concurrent Linguistic Error Detection (CLED): a New Methodology for Error Detection in Large Language Models
arxiv.org·5d
🧪Parser Testing
GitHub - vshakitskiy/how-to-otp: Learn how to work with OTP in Gleam!
github.com·1d
Gleam
Thinking, Searching, and Acting
interconnects.ai·1h
🎭Program Synthesis
ML-based profiling of data skew and bottlenecks on Databricks
dev.to·7h·
Discuss: DEV
🔮Speculative Execution
LLM-Deflate: Extracting LLMs into Datasets
scalarlm.com·2d·
Discuss: Hacker News
🪜Recursive Descent
Achieving TB-Level Aggregate Bandwidth: How JuiceFS Optimized Distributed Cache Network
dev.to·6h·
Discuss: DEV
🌍HTTP Servers
Web Developer Travis McCracken on Why I Use Rust for Stateless Microservices
dev.to·4h·
Discuss: DEV
🔧API Design
Vibe-coding and open-source: 286k LoC, 2 months
github.com·2d·
Discuss: Hacker News
📋Tablegen
Tracking prompt evolution for RAG systems - anyone else doing this?
github.com·18h·
Discuss: r/LocalLLaMA
Live Programming