Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography
arxiv.org·3d
💾vintage computing
Preview
Report Post

View PDF HTML (experimental)

Abstract:By integrating language understanding with perceptual modalities such as images, multimodal large language models (MLLMs) constitute a critical substrate for modern AI systems, particularly intelligent agents operating in open and interactive environments. However, their increasing accessibility also raises heightened risks of misuse, such as generating harmful or unsafe content. To mitigate these risks, alignment techniques are commonly applied to align model behavior with human values. Despite these efforts, recent studies have shown that jailbreak attacks can circumvent alignment and elicit unsafe outputs. Currently, most existing jailbreak methods are tailored for…

Similar Posts

Loading similar posts...