Security4 min read

AI safety controls at major labs remain easily bypassed three years after ChatGPT debut

Tags AI · Enterprise

The New York Times·May 14, 2026

A New York Times investigation found that AI safety guardrails at OpenAI, Anthropic, and Google remain trivially easy to bypass three years after ChatGPT's launch. The report documents consistent failures across all major providers, with jailbreak techniques that reliably produce harmful content. The findings raise questions about the effectiveness of current safety approaches and the gap between public safety commitments and technical reality.

Technical significance

The persistent failure of safety guardrails across all major AI providers suggests that current alignment techniques are insufficient for production systems. This has direct implications for enterprise adoption, regulatory compliance, and the credibility of voluntary safety commitments made by AI companies.

Sources

The New York Times

← Today's Digest