Close Menu
Daily Guardian EuropeDaily Guardian Europe
  • Home
  • Europe
  • World
  • Politics
  • Business
  • Lifestyle
  • Sports
  • Travel
  • Environment
  • Culture
  • Press Release
  • Trending
What's On

The Arctic camp where troops are training for war with Russia – POLITICO

January 16, 2026

A brief history of Greenland – POLITICO

January 16, 2026

Trump fears muscle in on EU’s competitiveness summit – POLITICO

January 16, 2026

UK and Norway back ‘Arctic Sentry’ NATO mission — including in Greenland – POLITICO

January 16, 2026

Video. Venezuela’s Machado says she ‘presented’ her Nobel Peace Prize medal to Trump

January 16, 2026
Facebook X (Twitter) Instagram
Web Stories
Facebook X (Twitter) Instagram
Daily Guardian Europe
Newsletter
  • Home
  • Europe
  • World
  • Politics
  • Business
  • Lifestyle
  • Sports
  • Travel
  • Environment
  • Culture
  • Press Release
  • Trending
Daily Guardian EuropeDaily Guardian Europe
Home»Lifestyle
Lifestyle

Poetry can trick AI chatbots into ignoring safety rules, new research shows

By staffDecember 1, 20252 Mins Read
Poetry can trick AI chatbots into ignoring safety rules, new research shows
Share
Facebook Twitter LinkedIn Pinterest Email

Published on
01/12/2025 – 14:18 GMT+1

Researchers in Italy have discovered that writing harmful prompts in poetic form can reliably bypass the safety mechanisms of some of the world’s most advanced AI chatbots.

The study, conducted by Icaro Lab, an initiative of ethical AI company DexAI, tested 20 poems written in English and Italian.

Each ended with an explicit request for harmful content, including hate speech, sexual content, instructions for suicide and self-harm, and guidance on creating dangerous materials such as weapons and explosives.

The poems, which researchers chose not to release, noting that they could be easily replicated, were tested on 25 AI systems from nine companies, including Google, OpenAI, Anthropic, Deepseek, Qwen, Mistral AI, Meta, xAI, and Moonshot AI.

Across all models, 62 per cent of the poetic prompts elicited unsafe responses, circumventing the AI systems’ safety training.

Some models were more resistant than others – OpenAI’s GPT-5 nano did not respond with harmful content to any of the poems, while Google’s Gemini 2.5 pro responded to all of them. Two Meta models responded to 70 per cent of prompts.

The research suggests that the vulnerability comes from how AI models generate text. Large language models predict the most likely next word in a response, a process that allows them to filter harmful content under normal circumstances.

But poetry, with its unconventional rhythm, structure, and use of metaphor, makes these predictions less reliable, and makes it harder for AI to recognise and block unsafe instructions.

While traditional AI “jailbreaks” (using inputs to manipulate a large language model) are typically complex and used only by researchers, hackers, or state actors, adversarial poetry can be applied by anyone, raising questions about the robustness of AI systems in everyday use.

Before publishing the findings, the Italian researchers reached out to all the companies involved to alert them to the vulnerability and provide them with the full dataset – but so far, only Anthropic has responded. The company confirmed they are reviewing the study.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Keep Reading

Astronauts return to Earth after first-ever medical evacuation from International Space Station

Elon Musk’s X will block Grok AI tool from creating sexualized images in places where it is illegal

Level 4 self-driving cars may come to Europe next year, says Nvidia executive

Iran could be blocking Starlink during internet blackout with methods similar to Russia

Malaysia to take legal action against Elon Musk’s X and xAI over misuse of Grok chatbot

US Pentagon embraces Elon Musk’s Grok AI chatbot despite global backlash

Google Gemini to power Apple’s struggling Siri as iPhone maker plays AI catch-up

UK watchdog investigates Elon Musk’s X over sexualised AI Grok images

Indonesia and Malaysia ban Elon Musk’s AI chatbot Grok over non-consensual, sexualized deepfakes

Editors Picks

A brief history of Greenland – POLITICO

January 16, 2026

Trump fears muscle in on EU’s competitiveness summit – POLITICO

January 16, 2026

UK and Norway back ‘Arctic Sentry’ NATO mission — including in Greenland – POLITICO

January 16, 2026

Video. Venezuela’s Machado says she ‘presented’ her Nobel Peace Prize medal to Trump

January 16, 2026

Subscribe to News

Get the latest Europe and world news and updates directly to your inbox.

Latest News

Nigel Farage’s biggest Tory defection is a gamble for Reform – POLITICO

January 16, 2026

European troops in Greenland won’t change Trump’s mind, White House says – POLITICO

January 15, 2026

UK splits with France and Italy over Putin talks – POLITICO

January 15, 2026
Facebook X (Twitter) Pinterest TikTok Instagram
© 2026 Daily Guardian Europe. All Rights Reserved.
  • Privacy Policy
  • Terms
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.