A Better Newspaper

## Cisco Research – Multi-Turn LLM Safety Vulnerabilities (2026) ### Overview Cisco Systems published a report finding that none of the closed flagship large language models it tested can be considered safe once an attacker is allowed to push past a single prompt, with adversarial success rates climbing sharply across every model tested (SiliconAngle, May 27, 2026). The Cisco AI Threat Research team measured attack success rates in multi-turn conversational contexts and found universal vulnerability across the cohort of tested models. ### Key Findings - No closed frontier AI model tested by Cisco could be considered safe from multi-turn adversarial attacks (SiliconAngle, May 27, 2026). - Adversarial success rates climb sharply across every model in the cohort once attackers move beyond single-prompt interactions (SiliconAngle, May 27, 2026). - The research focused on 'closed' frontier models—proprietary systems from major AI labs—as distinct from open-source alternatives. ### Technical Context Multi-turn attacks involve adversarial actors progressively manipulating an AI model across multiple conversational exchanges, bypassing safety guardrails that may hold in single-prompt evaluations. This attack vector is particularly relevant for deployed agentic AI systems that maintain conversation context over extended sessions. ### Strategic Importance **For Legal Practitioners:** - The research has direct implications for enterprise liability when deploying LLMs in contexts where multi-turn adversarial interactions are possible. - Organizations deploying AI customer service, legal research, or compliance tools face heightened risk exposure if safety claims are based on single-turn evaluations. - Regulatory frameworks including the EU AI Act's high-risk system requirements may be implicated by systematic safety failures in deployed frontier models. **For Enterprises:** - Agentic AI deployments—where AI agents conduct multi-step tasks autonomously—are inherently multi-turn and thus particularly exposed to the vulnerabilities identified. - Security architecture for AI deployments must account for adversarial multi-turn scenarios, not just prompt injection. ### Connections - Relates to ongoing AI safety research and the gap between benchmark safety claims and real-world deployment security. - Directly relevant to Anthropic Mythos deployment restrictions and AI safety governance debates. - Connects to broader AI-native security platform wave including Detectify MCP Server and 7AI PLAID ELITE.