Model Optimization, Inference Engines, LLM Quantization, Privacy-focused Deployments

Feeds to Scour
SubscribedAll
Scoured 4998 posts in 358.6 ms
Introducing the XLab AI Security Guide
lesswrong.com·9h
🛡️Computer Security
Preview
Report Post
Federation Over Embeddings: Let AI Agents Query Data Where It Lives
gnanaguru.com·1h·
Discuss: Hacker News
🏗️AI Infrastructure
Preview
Report Post
From Online Profile Paranoia to AI Complacency: Have We Really Stopped Caring About Privacy?
metamood.ai·2d
💻Local LLMs
Preview
Report Post
Optimizing LLM inference on Amazon SageMaker AI with BentoML’s LLM- Optimizer
aws.amazon.com·3d
🏗️AI Infrastructure
Preview
Report Post
No, Small Models Are Not the "Budget Option" (English)
mostlylucid.net·1h
💻Local LLMs
Preview
Report Post
I Built an Open Source Health AI Agent Without a Vector DB (Laravel 12, React, Typescript + InteriaJS + Gemini)
dev.to·6h·
Discuss: DEV
🤖AI Coding Tools
Preview
Report Post
is this legit? Supposedly LangVAE straps a VAE + compression algorithm onto any LLM image, reduces resource requirements by up to...
arxiv.org·3d·
Discuss: r/LocalLLaMA
💻Local LLMs
Preview
Report Post
Yann LeCun’s VL-JEPA: The breakthrough that gives AI a "Mind's Eye" (instead of just a mouth).
hisohan.substack.com·8h·
Discuss: Substack
📱Edge AI
Preview
Report Post
A Linux User’s Approach to Local, Privacy-Respecting Image Editing using Local AI Model
reddit.com·2d·
Discuss: r/linux
💻Local LLMs
Preview
Report Post
wwes4/AI_Accel_1.5x: AI acceleration framework for ~1.5x speedups in mid-sized models via tension-based pruning. Built utilizing xAI's Grok.
github.com·1d·
Discuss: Hacker News
🔥PyTorch
Preview
Report Post
Show HN: Why is ML inference still so ad-hoc in practice?
news.ycombinator.com·1d·
Discuss: Hacker News
🚀MLOps
Preview
Report Post
AIAuditTrack: A Framework for AI Security system
arxiv.org·2d
🏗️AI Infrastructure
Preview
Report Post
AI Infrastructure Basics: How MCP Works
newsletter.systemdesign.one
·1d·
Discuss: r/programming
🏗️AI Infrastructure
Preview
Report Post
Thread by @theresanaiforit on Thread Reader App
threadreaderapp.com·3h
🤖Anthropic Claude
Preview
Report Post
How To Use LLM-Powered Coding Assistants Safely: Risks & Best Practices
xebia.com·13h·
Discuss: Hacker News
🤖AI Coding Tools
Preview
Report Post
What Deep Learning Theory Teaches Us About AI Memory
dev.to·1d·
Discuss: DEV
🧠Memory Models
Preview
Report Post
How IntelliNode Automates Complex Workflows with Vibe Agents
towardsdatascience.com·13h
🤖AI agents
Preview
Report Post
Show HN: Chat-DeepAI – DeepSeek pricing and getting-started guides (fan project)
chat-deepai.com·12h·
Discuss: Hacker News
🏗️AI Infrastructure
Preview
Report Post
How to build agentic AI when your data can’t leave the network
blog.logrocket.com·4d
💻Local LLMs
Preview
Report Post
Architecting Enterprise grade Multi‑Agent AI with AWS Strands & Amazon Bedrock AgentCore
dev.to·19h·
Discuss: DEV
🤖AI agents
Preview
Report Post