The Normalization of Deviance in AI
Explores the 'Normalization of Deviance' concept in AI safety, warning against complacency with LLM vulnerabilities like prompt injection.
Explores the 'Normalization of Deviance' concept in AI safety, warning against complacency with LLM vulnerabilities like prompt injection.
Anthropic's internal 'soul document' used to train Claude 4.5 Opus's personality and values has been confirmed and partially revealed.
Analysis of GPT-5.1's new adaptive thinking features, model routing system, and safety benchmarks from the system card addendum.
A personal review of Nick Bostrom's book on AI superintelligence, exploring its paths, dangers, and the crucial control problem.