Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI

Feeds to Scour
SubscribedAll
Scoured 5014 posts in 197.4 ms
No, Small Models Are Not the "Budget Option" (English)
mostlylucid.net·6h
🚀MLOps
Preview
Report Post
Build a Privacy-First AI Text Classifier: No Servers, No APIs. Master the AI Lifecycle: Initialize, Train, and Deploy.
dev.to·1h·
Discuss: DEV
🏠Self-hosted AI
Preview
Report Post
is this legit? Supposedly LangVAE straps a VAE + compression algorithm onto any LLM image, reduces resource requirements by up to...
arxiv.org·3d·
Discuss: r/LocalLLaMA
📱Edge AI
Preview
Report Post
Yann LeCun’s VL-JEPA: The breakthrough that gives AI a "Mind's Eye" (instead of just a mouth).
hisohan.substack.com·13h·
Discuss: Substack
📱Edge AI
Preview
Report Post
From Online Profile Paranoia to AI Complacency: Have We Really Stopped Caring About Privacy?
metamood.ai·2d
🏠Self-hosted AI
Preview
Report Post
Introducing the XLab AI Security Guide
lesswrong.com·14h
🛡️Computer Security
Preview
Report Post
GateBreaker: Gate-Guided Attacks on Mixture-of-Expert LLMs
arxiv.org·3d
🏠Self-hosted AI
Preview
Report Post
Running Local LLMs in Game Engines - Here's My Journey with Godot + Ollama
dev.to·16h·
Discuss: DEV
☁️Serverless Rust
Preview
Report Post
Show HN: Why is ML inference still so ad-hoc in practice?
news.ycombinator.com·1d·
Discuss: Hacker News
🚀MLOps
Preview
Report Post
The Transformer Architecture: A Deep Dive into How LLMs Actually Work
dev.to·11h·
Discuss: DEV
🤖Transformers
Preview
Report Post
vagos/llm-grep: Match lines using both classic and semantic regular expressions with LLM.
github.com·1d·
Discuss: Lobsters
📝NLP
Preview
Report Post
A Linux User’s Approach to Local, Privacy-Respecting Image Editing using Local AI Model
reddit.com·2d·
Discuss: r/linux
🏠Self-hosted AI
Preview
Report Post
Latest Trends in Large Language Models (LLMs)
dev.to·1h·
Discuss: DEV
🎙️Whisper
Preview
Report Post
Real Time Detection and Quantitative Analysis of Spurious Forgetting in Continual Learning
arxiv.org·3d
📝Parser Combinators
Preview
Report Post
Study: Shrinking AI memory boosts accuracy
ed.ac.uk·4d·
Discuss: Hacker News
📱Edge AI
Preview
Report Post
TRUNAJOD: A text complexity library for text analysis built on spaCy — TRUNAJOD 0.1.1 documentation
trunajod20.readthedocs.io·16h
📝NLP
Preview
Report Post
Network Trimming: A Data-Driven Neuron Pruning Approach towards Efficient DeepArchitectures
dev.to·1h·
Discuss: DEV
📱Edge AI
Preview
Report Post
How To Use LLM-Powered Coding Assistants Safely: Risks & Best Practices
xebia.com·18h·
Discuss: Hacker News
🤖AI Coding Tools
Preview
Report Post
Book Review: Why Machines Learn
philippdubach.com·1d·
Discuss: Hacker News
📱Edge AI
Preview
Report Post
Optimizing LLM inference on Amazon SageMaker AI with BentoML’s LLM- Optimizer
aws.amazon.com·3d
🏗️AI Infrastructure
Preview
Report Post