RiffOn - Red-Teaming after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan

Beyond cyber: Gray Swan's founders on red-teaming AI agents, tackling prompt injection, and building a new layer of AI-specific security.

AI and Humans Possess Orthogonal Vulnerabilities, Not Superior or Inferior Security

An AI might resist a sophisticated attack but fall for a simple trick a human never would (e.g., an email saying "this is a simulation"). This shows AI vulnerabilities are not a subset or superset of human ones, but occupy a different dimension entirely. Direct robustness comparisons can be misleading.