How a Bit Becomes a Story: Semantic Steering via Differentiable Fault Injection
arxiv.org·1d
💬Prompt Engineering
Preview
Report Post

View PDF HTML (experimental)

Abstract:Hard-to-detect hardware bit flips, from either malicious circuitry or bugs, have already been shown to make transformers vulnerable in non-generative tasks. This work, for the first time, investigates how low-level, bitwise perturbations (fault injection) to the weights of a large language model (LLM) used for image captioning can influence the semantic meaning of its generated descriptions while preserving grammatical structure. While prior fault analysis methods have shown that flipping a few bits can crash classifiers or degrade accuracy, these approaches overlook the semantic and linguistic dimensions of generative systems. In image captioning models, a single fl…

Similar Posts

Loading similar posts...