Unlocking LLMs: Privacy-First Inference for Everyone by Arvind Sundararajan
dev.toยท1dยท
Discuss: DEV
๐Ÿ Self-hosted AI
The Unseen Variable: Why Your LLM Gives Different Answers (and How We Can Fix It)
hackernoon.comยท16h
๐Ÿ—๏ธAI Infrastructure
LLM in the Middle: A Systematic Review of Threats and Mitigations to Real-World LLM-based Systems
arxiv.orgยท21h
๐Ÿ Self-hosted AI
Google releases VaultGemma, its first privacy-preserving LLM
arstechnica.comยท1dยท
Discuss: Hacker News
๐Ÿ Self-hosted AI
Unlocking LLMs: Secure Inference for the Rest of Us
dev.toยท2dยท
Discuss: DEV
๐Ÿ Self-hosted AI
Lockdown LLMs: Unleashing AI Power While Safeguarding User Privacy
dev.toยท1dยท
Discuss: DEV
๐Ÿ Self-hosted AI
Unlocking LLMs: Secure, Efficient Inference for Everyone
dev.toยท3dยท
Discuss: DEV
๐Ÿ Self-hosted AI
Shielded Minds: Unleashing Private LLM Inference by Arvind Sundararajan
dev.toยท1dยท
Discuss: DEV
๐Ÿ Self-hosted AI
Erase and Rewind: Precise LLM Memory Manipulation for Safer AI by Arvind Sundararajan
dev.toยท17hยท
Discuss: DEV
๐Ÿ Self-hosted AI
AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs
arxiv.orgยท21h
๐Ÿ—๏ธAI Infrastructure
Google Releases VaultGemma, Its First Privacy-Preserving LLM
yro.slashdot.orgยท11h
๐Ÿ Self-hosted AI
Private LLM Inference: Democratizing AI with Ciphertext Computations
dev.toยท3dยท
Discuss: DEV
๐Ÿ Self-hosted AI
The Case for Compact AI โ€“ Communications of the ACM
dl.acm.orgยท17hยท
Discuss: Hacker News
๐Ÿ—๏ธAI Infrastructure
LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.orgยท1d
๐Ÿ—๏ธAI Infrastructure
AI Unleashed: Secure LLM Inference for Everyone
dev.toยท2dยท
Discuss: DEV
๐Ÿ Self-hosted AI
Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)
semiengineering.comยท1d
๐Ÿง Neuromorphic Hardware
AI Obfuscation: Shielding Predictions with Uncertainty
dev.toยท7hยท
Discuss: DEV
๐Ÿ Self-hosted AI
No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
arxiv.orgยท21h
๐Ÿ Self-hosted AI
I built an LLM from Scratch in Rust (Just ndarray and rand)
github.comยท2dยท
๐Ÿ—๏ธAI Infrastructure