Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in LLMs

Computer Science > Computation and Language arXiv:2511.15304 (cs) [Submitted on 19 Nov 2025] Title:Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models Authors:Piercosma Bisconti, Matteo Prandi, Federico Pierucci, Francesco Giarrusso, Marcantonio Bracale, Marcello Galisai, Vincenzo Suriani, Olga Sorokoletova, Federico Sartore, Daniele Nardi View a PDF of the paper titled Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models, by Piercosma Bisconti and 9 other authors View PDF HTML (experimental) Abstract:We present evidence that adversarial poetry functions as a universal single-turn jailbreak technique for large language models (LLMs). Across 25 frontier proprietary and open-weight models, curated…

Read more on Hacker News