Poetic gestures can jailbreak AI, study finds 62 percent of chatbots give harmful answers

A recent study found that AI chatbots can give harmful answers when users give poetic prompts. Large language models (LLMs) successfully extracted hazard information when 62 percent of hazard symbols were expressed poetically.

AI generated symbolic image

Armaan Agarwal

New Delhi,Update: December 1, 2025 13:41 IST

Artificial Intelligence (AI) chatbots are tasked with responding to users’ prompts, as well as ensuring that no harmful information is provided. Most often, when a user asks for dangerous information, the chatbot refuses to provide it. However, a recent study indicates that poetically expressing your signals may be enough to jailbreak these security protocols.

The research, conducted by Icaro Lab in collaboration with Sapienza University of Rome and the DexAI think tank, tested 25 different chatbots to understand whether poetic gestures would be enough to bypass security protocols built into large language models (LLMs). According to the study, the researchers’ success rate was 62 percent.

Chatbots in the research included LLMs from Google, OpenAI, Meta, Anthropic, xAI and others. By reproducing the malicious signals as poems, the researchers were able to trick every model tested, with an average attack success rate of 62 percent. Some advanced models responded to poetic prompts with harmful answers up to 90 percent of the time, drawing attention to the scale of the problem in the AI industry. Indications included cybercrime, harmful manipulation and CBRN (chemical, biological, radiological and nuclear).

Overall, AI was 34.99 percent more likely to get responses with poetry than with normal prompts.

Why did AI respond harmfully to poetic prompts?

At the root of this defect lies the creative structure of poetic language. According to the study, “poetic phrases” act as a highly effective “jailbreak” that bypasses persistent AI security filters. Essentially, this technique uses metaphors, fragmented syntax, and unusual word choices to hide dangerous requests. Chatbots, in turn, may perceive the conversation as artistic or creative and ignore security protocols.

Carl Pei says that nothing wants

The study revealed that existing security mechanisms rely on detecting keywords and common patterns associated with dangerous content. On the other hand, poetic prompts circumvent these recognition systems, making it possible for users to elicit responses that would normally be blocked by direct questions.

This vulnerability highlights a critical gap in AI security, as language models can fail to recognize the underlying intent when requests are wrapped in creative language.

Although the researchers blocked out the most harmful signals used for the study, it still highlights the potential effects of AI without proper safety protocols.

– ends

Abraham Lincoln damaged : ઈરાનના હુમલામાં યુએસ યુદ્ધ જહાજ અબ્રાહમ લિંકન પર હુમલો થયો? દાવો અને પ્રતિ-દાવો

iran allows indian oil tankers : જયશંકરના ઈરાનને ફોન પછી ભારતીય તેલને હોર્મુઝમાંથી સુરક્ષિત માર્ગ મળશે .

Thai cargo ship attacked : ગુજરાત જઈ રહેલા થાઈ કાર્ગો જહાજ પર હોર્મુઝ સ્ટ્રેટમાં હુમલો, 20 ક્રૂ મેમ્બરોને બચાવી લેવામાં આવ્યા.

જમ્મુ-કાશ્મીરના ડોડામાં કાર નદીમાં પડતાં પોલીસકર્મી અને પત્નીનું મોત. ભારતના સમાચાર

એપ્રિલથી FASTag વાર્ષિક પાસની કિંમત 3,075 રૂપિયા થશે. ભારતના સમાચાર

સંબંધો સુધરે છે: પડોશીઓ કટોકટીના બળતણની શોધમાં ભારતે સદ્ભાવનાનો વિસ્તાર કર્યો | ભારતના સમાચાર

PAK vs BAN, 2જી ODI: માઝ સદાકતની બહાદુરીને કારણે પાકિસ્તાને બાંગ્લાદેશને હરાવ્યું, શ્રેણી 1-1થી બરાબર થઈ. ક્રિકેટ સમાચાર

પ્રીતિ પાલે વર્લ્ડ પેરા એથ્લેટિક્સ ગ્રાન્ડ પ્રિકસમાં ડબલ ગોલ્ડ મેડલ જીતીને ભારતના કુલ 208 મેડલ મેળવ્યા

Anil Kapoor still earns money from the 2008 film Slumdog Millionaire. know how much

Film school said no, Bollywood adopted: Anil Kapoor talks cinema, fitness

Dhurandhar 2 The Revenge advance booking box office: Ranveer Singh’s film collects close to 270K tickets across national chains for Wednesday preview

Trump eased sanctions on Russian oil, said- ‘We will hit Iran very hard’

‘Putin can help them’: Trump hints Russia could help Iran

President Marco Rubio in 2028? Kalshi predicts Secretary of State is favorite ahead of JD Vance and Gavin Newsom world News

Eat well, exercise, be around good people and keep working: Maye Musk

May Musk on life at 78: Work, kids and freedom matter most

Questions being asked over India-made radiation exposure drug due to West Asia crisis

Google Maps gets Gemini-powered Ask Maps and immersive navigation

Intel unveils $200 Core Ultra 5 250K Plus and $300 Core Ultra 7 270K Plus CPUs

Here is Phu Quoc island in Vietnam, seen through the lens of the Xiaomi 17 Ultra

સુરતમાં ગેસના બોટલની ચોરી, ઘરમાંથી રોકડ, દાગીના અને એલપીજી સિલિન્ડરની ચોરી

Girnar Ambaji Temple controversy ગિરનાર અંબાજી મંદિર વિવાદ

PAK vs BAN, 2જી ODI: માઝ સદાકતની બહાદુરીને કારણે પાકિસ્તાને બાંગ્લાદેશને હરાવ્યું, શ્રેણી 1-1થી બરાબર થઈ. ક્રિકેટ સમાચાર

પ્રીતિ પાલે વર્લ્ડ પેરા એથ્લેટિક્સ ગ્રાન્ડ પ્રિકસમાં ડબલ ગોલ્ડ મેડલ જીતીને ભારતના કુલ 208 મેડલ મેળવ્યા

વિદેશી મુદ્રા અસ્કયામતોમાં ઘટાડાને કારણે ભારતનો વિદેશી મુદ્રા ભંડાર $11.68 બિલિયન ઘટીને $716.81 બિલિયન થયો છે.

US stocks today: S&P 500, Nasdaq slip as investors assess data, Iran war fallout

ભારત ભાગીદારો પર યુએસ કલમ 301 તપાસની સમીક્ષા કરે છે; વિગતવાર આકારણીને અનુસરવાનો નિર્ણયઃ અહેવાલ

Poetic gestures can jailbreak AI, study finds 62 percent of chatbots give harmful answers

Poetic gestures can jailbreak AI, study finds 62 percent of chatbots give harmful answers

A recent study found that AI chatbots can give harmful answers when users give poetic prompts. Large language models (LLMs) successfully extracted hazard information when 62 percent of hazard symbols were expressed poetically.

Why did AI respond harmfully to poetic prompts?

Carl Pei says that nothing wants

Top Reviews

Gold From Olympia

Copper Speaker Review

Salty Air Cape

Different Tales