FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning
arxiv.org·2d
⚔️Lean Tactics
Preview
Report Post

View PDF HTML (experimental)

Abstract:Generative Artificial Intelligence models, such as Large Language Models (LLMs) and Large Vision Models (VLMs), exhibit state-of-the-art performance but remain vulnerable to hardware-based threats, specifically bit-flip attacks (BFAs). Existing BFA discovery methods lack generalizability and struggle to scale, often failing to analyze the vast parameter space and complex interdependencies of modern foundation models in a reasonable time. This paper proposes FlipLLM, a reinforcement learning (RL) architecture-agnostic framework that formulates BFA discovery as a sequential decision-making problem. FlipLLM combines sensitivity-guided layer pruning with Q-learning t…

Similar Posts

Loading similar posts...