Model Optimization, Inference Engines, LLM Quantization, Privacy-focused Deployments

Private LLM Inference: Democratizing AI with Ciphertext Computations
dev.to·1d·
Discuss: DEV
💻Local LLMs
I built an LLM from Scratch in Rust (Just ndarray and rand)
github.com·15h·
Discuss: r/rust
💻Local LLMs
AI Unleashed: Secure LLM Inference for Everyone
dev.to·15h·
Discuss: DEV
💻Local LLMs
Unlocking LLMs: Secure Inference for the Rest of Us
dev.to·6h·
Discuss: DEV
💻Local LLMs
Personas within Parameters: Fine-Tuning Small Language Models with Low-Rank Adapters to Mimic User Behaviors
arxiv.org·3h
💻Local LLMs
Disaggregated Inference at Scale with PyTorch and VLLM
pytorch.org·1d·
Discuss: Hacker News
🏗️AI Infrastructure
The Data Backbone of LLM Systems
infoq.com·3d·
Discuss: Lobsters
🏗️AI Infrastructure
AI for Everyone: Secure Language Models Without the Hardware Hype
dev.to·1d·
Discuss: DEV
💻Local LLMs
Unlocking LLMs: Secure, Efficient Inference for Everyone
dev.to·1d·
Discuss: DEV
💻Local LLMs
Demystifying LLM Tuning: XAI-Powered Optimization Unveiled by Arvind Sundararajan
dev.to·5h·
Discuss: DEV
💻Local LLMs
VaultGemma: The world's most capable differentially private LLM
research.google·2d·
💻Local LLMs
So You Want to Host Your Own LLM? Don't
mahdiyusuf.com·5h·
🏢Self-hosting
How I became a machine learning practitioner (2019)
blog.gregbrockman.com·4h·
Discuss: Hacker News
🏗️AI Infrastructure
Democratizing AI: Structured Deep RL for Everyone
dev.to·11h·
Discuss: DEV
🏗️AI Infrastructure
LLM Rerankers for RAG: A Practical Guide
fin.ai·9h·
Discuss: Hacker News
🏗️AI Infrastructure
The mistake I made with my first AI Agent (and the simpler fix)
reddit.com·7h·
Discuss: r/artificial
🤖AI agents
(2/4) LLM: Data, Transformers, and Relentless Compute
dev.to·1d·
Discuss: DEV
💻Local LLMs
LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.org·3h
💻Local LLMs
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·3h·
Discuss: Hacker News
🎯Vector Databases
Differentially Private Decentralized Dataset Synthesis Through Randomized Mixing with Correlated Noise
arxiv.org·3h
🤝Federated Learning