Local LLM Server, Model Management, API Server, Inference Engine

Tiny GenBI: Lightweight Agent for business analysis
github.com·9h·
Discuss: Hacker News
🔍Paradedb
Flag this post
Run LLMs Locally
ikangai.com·13h·
Discuss: Hacker News
🏗️LLM Infrastructure
Flag this post
Show HN: Kumi – a portable, declarative, functional core for business logic
kumi-play-web.fly.dev·12h·
Discuss: Hacker News
💻Programming languages
Flag this post
What's the stack for going from a fine-tune on vLLM to a simple, paid public API?
reddit.com·7h·
Discuss: r/LocalLLaMA
🏗️LLM Infrastructure
Flag this post
An introduction to program synthesis (Part II) - Automatically generating features for machine learning
mchav.github.io·21h·
Discuss: r/programming
🎯Qdrant
Flag this post
Build intelligent agents with every leading model on Databricks
databricks.com·15h
🆕New AI
Flag this post
Co-Optimizing GPU Architecture And SW To Enhance Edge Inference Performance (NVIDIA)
semiengineering.com·13h
🏗️LLM Infrastructure
Flag this post
The Complexity Cliff: Why Reasoning Models Work Right Up Until They Don't
rewire.it·8h·
Discuss: Hacker News
🏗️LLM Infrastructure
Flag this post
The InsightOS architecture
feedly.com·18h
🏝️Islands Architecture
Flag this post
Building Real-Time ML Feature Pipelines with Streaming SQL
timeplus.com·14h·
Discuss: Hacker News
🧠Inference Serving
Flag this post
LazyLLM, Easiest and laziest way for building multi-agent LLMs applications
github.com·8h·
Discuss: Hacker News
🏗️LLM Infrastructure
Flag this post
An ARENA 6.0 Capstone: Model Organism of Encoded Reasoning
lesswrong.com·8h
🔤Tokenization
Flag this post
OpenLoRa: Validating LoRa Implementations Through an Open-Sourced Framework
usenix.org·22h·
Discuss: Hacker News
🌐Pingora
Flag this post
Magentic Marketplace: an open-source simulation environment for studying agentic markets
microsoft.com·14h
💹Platform Economics
Flag this post
Beyond Basic RAG: AI Agents for Context-Aware Responses
thenewstack.io·15h
🔄LLM RAG Pipelines
Flag this post
How Databricks Implemented Intelligent Kubernetes Load Balancing
blog.bytebytego.com·15h
💎Durable Objects
Flag this post
Text to SQL: Local, Secure, and Smarter
exasol.com·20h·
Discuss: Hacker News
📝Prepared Statements
Flag this post
AI Agent Orchestration Frameworks
blog.n8n.io·22h·
Discuss: Hacker News
🔧Developer tools
Flag this post
LDBT instead of DBTL: combining machine learning and rapid cell-free testing
nature.com·16h
🏗️LLM Infrastructure
Flag this post
Building a highly-available web service without a database
screenshotbot.io·23h·
Discuss: r/programming
🗳️Raft Consensus
Flag this post