VRSA: Jailbreaking Multimodal Large Language Models through Visual Reasoning Sequential Attack
arxiv.org·1d
💻Local LLMs
Preview
Report Post

View PDF HTML (experimental)

Abstract:Multimodal Large Language Models (MLLMs) are widely used in various fields due to their powerful cross-modal comprehension and generation capabilities. However, more modalities bring more vulnerabilities to being utilized for jailbreak attacks, which induces MLLMs to output harmful content. Due to the strong reasoning ability of MLLMs, previous jailbreak attacks try to explore reasoning safety risk in text modal, while similar threats have been largely overlooked in the visual modal. To fully evaluate potential safety risks in the visual reasoning task, we propose Visual Reasoning Sequential Attack (VRSA), which induces MLLMs to gradually externalize and aggregate co…

Similar Posts

Loading similar posts...