How fast can an LLM go?
fergusfinn.com·19h·
Discuss: Hacker News
🏗️LLM Infrastructure
Flag this post
Building "RAG from Scratch". A local, educational repo to really understand Retrieval-Augmented Generation (feedback welcome)
reddit.com·18h·
Discuss: r/LocalLLaMA
🎯Qdrant
Flag this post
Raising the Bar on ML Model Deployment Safety
uber.com·16h
🏗️LLM Infrastructure
Flag this post
What Is an AI PaaS? A Guide to the Future of AI Development
thenewstack.io·10h
🏗️LLM Infrastructure
Flag this post
Show HN: I built a lightweight AI tool to analyze visitor behavior
getallinsights.com·15h·
Discuss: Hacker News
📊Feed Optimization
Flag this post
Vibe Check: I Canceled Two AI Max Plans for Factory’s Coding Agent Droid
kill-the-newsletter.com·13h
🛡️AI Security
Flag this post
Opportunistically Parallel Lambda Calculus
dl.acm.org·8h·
Discuss: Hacker News
💻Programming languages
Flag this post
Quadric: Revolutionizing Edge AI
semiwiki.com·13h
📱Edge AI Optimization
Flag this post
Tencent/WeKnora
github.com·4h
🔎Meilisearch
Flag this post
Self-evolving edge AI enables real-time learning and forecasting in small devices
techxplore.com·10h
📱Edge AI Optimization
Flag this post
Pseudo-Knowledge Graphs for Better RAG
pub.towardsai.net·14h
🔄LLM RAG Pipelines
Flag this post
How We Saved 70% of CPU and 60% of Memory in Refinery’s Go Code, No Rust Required.
honeycomb.io·5h·
Discuss: r/programming
🔬Rust Profiling
Flag this post
Show HN: Fast-posit, sw implementation of posit arithmetic in Rust
github.com·12h·
Discuss: Hacker News
🔎Tantivy
Flag this post
Advances In Formal Verification Technology
semiengineering.com·23h
🧮SMT Solvers
Flag this post
Scaling Embeddings with Feast and KubeRay
feast.dev·14h·
Discuss: Hacker News
🏗️LLM Infrastructure
Flag this post
How to Build Digital Twins for Operational Efficiency
databricks.com·15h
🏗️LLM Infrastructure
Flag this post
Designing Smarter Health Checks for Pomerium
pomerium.com·13h·
Discuss: Hacker News
💎Durable Objects
Flag this post
The internet’s dirty secret: streaming is killing the planet
thehill.com·18h
🌍Climate
Flag this post
Toward provably private insights into AI use
research.google·19h
🏗️LLM Infrastructure
Flag this post
Discovery of obesity genes through cross-ancestry analysis
nature.com·18h
📇Vector Indexing
Flag this post