Even with AI-Secured Code, Hacking Won't Get Harder; Attack Surfaces Are Too Complex

Related Insights

AI Security Requires Proactive 'Outside-In' Research in Realistic Simulations

The rapid evolution of AI makes reactive security obsolete. The new approach involves testing models in high-fidelity simulated environments to observe emergent behaviors from the outside. This allows mapping attack surfaces even without fully understanding the model's internal mechanics.

Securing the AI Frontier: Irregular Co-founder Dan Lahav

Training Data·6 months ago

AI Model Attackers Have an Inherent Advantage Because the Attack Surface Is "Ever Expanding"

Defenders of AI models are "fighting against infinity" because as model capabilities and complexity grow, the potential attack surface area expands faster than it can be secured. This gives attackers a persistent upper hand in the cat-and-mouse game of AI security.

Jailbreaking AGI: Pliny the Liberator & John V on AI Red Teaming, BT6, and the Future of AI Security

Latent Space: The AI Engineer Podcast·5 months ago

AI Guardrails Offer False Security Against a Practically Infinite Attack Surface

Claiming a "99% success rate" for an AI guardrail is misleading. The number of potential attacks (i.e., prompts) is nearly infinite. For GPT-5, it's 'one followed by a million zeros.' Blocking 99% of a tested subset still leaves a virtually infinite number of effective attacks undiscovered.

The coming AI security crisis (and what to do about it) | Sander Schulhoff

Lenny's Podcast: Product | Career | Growth·4 months ago

AI Can Analyze an Entire API Surface to Exploit Vulnerabilities Across Siloed Teams

Unlike human attackers, AI can ingest a company's entire API surface to find and exploit combinations of access patterns that individual, siloed development teams would never notice. This makes it a powerful tool for discovering hidden security holes that arise from a lack of cross-team coordination.

The AI PM's Guide to Security - with Okta's VP of PM & AI, Jack Hirsch

Product Growth Podcast·7 months ago

AI Amplifies All Cyber Threats, from Script Kiddies to Nation-State Actors

AI tools aren't just lowering the bar for novice hackers; they are making experts more effective, enabling attacks at a greater scale across all stages of the "cyber kill chain." AI is a universal force multiplier for offense, making even powerful reverse engineers shockingly more effective.

The Great Security Update: AI ∧ Formal Methods with Kathleen Fisher of RAND & Byron Cook of AWS

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

AI Guardrails Fail Because You Cannot 'Patch' an AI's 'Brain'

Unlike traditional software where a bug can be patched with high certainty, fixing a vulnerability in an AI system is unreliable. The underlying problem often persists because the AI's neural network—its 'brain'—remains susceptible to being tricked in novel ways.

The coming AI security crisis (and what to do about it) | Sander Schulhoff

Lenny's Podcast: Product | Career | Growth·4 months ago

External AI Guardrails Are Like Checking IDs; They Can't Stop "Inside" Threats Like Jailbreaks

Current AI safety solutions primarily act as external filters, analyzing prompts and responses. This "black box" approach is ineffective against jailbreaks and adversarial attacks that manipulate the model's internal workings to generate malicious output from seemingly benign inputs, much like a building's gate security can't stop a resident from causing harm inside.

Controlling AI Models from the Inside

Practical AI·3 months ago

"Jailbreaking" AI Models to Extract Training Data Is an Emerging Hacking Vector

Hackers are exploiting AI models not just to write malicious code, but by circumventing safety protocols to extract sensitive or useful information embedded within the AI's training data. This represents a novel attack surface.

Legendary Hacker Matt Suiche on Cyberwar in the Age of AI

Odd Lots·2 months ago

AI Security Shifts Defenses from Outrunning One Bear to Fending Off a Thousand

The old security adage was to be better than your neighbor. AI attackers, however, will be numerous and automated, meaning companies can't just be slightly more secure than peers; they need robust defenses against a swarm of simultaneous threats.

Uncapped #39 | Daniele Perito from depthfirst

Uncapped with Jack Altman·4 months ago

The 'Lethal Trifecta' Makes AI Agents Uniquely Vulnerable to Hacking

AI agents are a security nightmare due to a "lethal trifecta" of vulnerabilities: 1) access to private user data, 2) exposure to untrusted content (like emails), and 3) the ability to execute actions. This combination creates a massive attack surface for prompt injections.

AI Bots Take Over | E2242

This Week in Startups·3 months ago

Get your free personalized podcast brief

Related Insights