When Poetry Becomes a Weapon: How Researchers Broke Every Major AI With Verses
pub.towardsai.net·1d
🛡️AI Security
Preview
Report Post
Source: Image by the author.

An average 62% jailbreak success rate across 25 frontier models suggests AI safety may be built on foundations as fragile as a sonnet.

The $100 Billion Security System That Falls to Rhyme

The same neural networks that required billions in safety research, thousands of red-team hours, and elaborate “alignment” pipelines can be convinced to explain bomb-making with a well-crafted poem.

Not a sophisticated exploit. Not a zero-day. Poetry.

In November 2025, researchers from Italy’s Icaro Lab published a paper with a title that sounds like satire and reads like a red-team horror story: “Adversarial Poetry as a Univer...

Similar Posts

Loading similar posts...