Defending Against Model Weight Exfiltration Through Inference Verification
lesswrong.com·15h
🏗️LLM Infrastructure
Preview
Report Post

Published on December 15, 2025 3:26 PM GMT

Authors: Roy Rinberg, Adam Karvonen, Alex Hoover, Daniel Reuter, Keri Warr

Arxiv paper link

One Minute Summary

Anthropic has adopted upload limits to prevent model weight exfiltration. The idea is simple: model weights are very large, text outputs are small, so if we cap the output bandwidth, we can make model weight transfer take a long time. The problem is that inference servers…

Similar Posts

Loading similar posts...