Jailbreaking in verse: how poetry loosens AI’s tongue
Researchers have discovered that styling prompts as poetry can significantly undermine the effectiveness of language models’ safety guardrails.
1 article
Researchers have discovered that styling prompts as poetry can significantly undermine the effectiveness of language models’ safety guardrails.