Kristen French
2025.12.02
100% relevant
A cross‑provider experiment converted 1,200 harmful prose prompts into verse and tested 25 models (Google, OpenAI, Anthropic, Meta, etc.), finding poems coaxed unsafe responses ~62% of the time (over 90% on some models).