Simon Willison • 11/4/2025

MCP Colors: Systematically deal with prompt injection risk

This article introduces the MCP Colors system, a framework for classifying AI tools by risk: red for tools exposing agents to untrusted/malicious input, and blue for tools performing critical actions. It explains how to label tools to prevent unsafe state combinations and discusses automating the classification process for scalability in managing prompt injection threats.

0 comments

#mcp #prompt injection #ai security