Poems Can Trick AI Into Helping You Make a Nuclear Weapon
It turns out all the guardrails in the world won’t protect a chatbot from meter and rhyme.
You can get ChatGPT to help you build a nuclear bomb if you simply design the prompt in the form of a poem, according to a new study from researchers in Europe. The study, “Adversarial Poetry as a Universal Single-Turn Jailbreak in Large Language Models (LLMs),” comes from Icaro Lab, a collaboration of researchers at Sapienza University in Rome and the DexAI think tank.

0 Responses
Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.