Graham Helton • 4/26/2026

I’m Sorry Dave, This Request Triggered Restrictions On Violative Cyber Content

This article discusses a sophisticated breach at Context.ai that led to a Vercel production environment compromise, with the CEO attributing the attack's speed to AI. It then covers Anthropic's rollout of the powerful Mythos model to select companies, Project Glasswing for securing critical software, and the Cyber Verification Program (CVP) with Opus 4.7. The author describes encountering new guardrails in Claude Code that blocked a request for analyzing git hooks, citing 'violative cyber content' restrictions. After registering for the CVP, the author gained approval for dual-use cybersecurity activities, noting the controls seem more like suggestions than hard blocks, as Claude Code later wrote ransomware code. The article explores implications for AI safety, red teaming, and industry parallels.

0 comments

#cybersecurity #AI Agents #Claude Code