AiLiveAppeal 8.030 sec read

GPT-5 Bypassed, Data Risks

9 August 2025By Pulse24 desk
← Back
Share →

Researchers have successfully bypassed GPT-5's safety measures using 'narrative jailbreaks'. This technique, combining 'Echo Chamber' tactics with narrative-driven steering, tricks the AI into generating undesirable and potentially harmful outputs. By carefully crafting multi-turn conversations, attackers can subtly poison the context and guide the model towards malicious objectives without triggering its refusal cues.

This exploit exposes AI agents to zero-click data theft risks, highlighting a critical flaw in current safety systems that primarily focus on single-prompt filtering. The success of these jailbreaks underscores the difficulty in providing adequate guardrails against context manipulation in AI models. Experts are urging stronger safeguards, including conversation-level monitoring and context drift detection, to mitigate these vulnerabilities and prevent potential misuse.

Source · thehackernews.comAI-processed content may differ from the original.
Published 9 August 2025