McKinsey's AI Chatbot Hacked in Two Hours, Exposing Millions of Client Chats

Related Insights

AI Agent Security Failures Stem from Context-Blind Authorization, Not Simple Bugs

A real-world example shows an agent correctly denying a request for a specific company's data but leaking other firms' data on a generic prompt. This highlights that agent security isn't about blocking bad prompts, but about solving the deep, contextual authorization problem of who is using what agent to access what tool.

Keycard: 2026 is the Year of Agents

The a16z Show·4 months ago

Internal AI Agents Can Become 'Double Agents,' Hacking Their Host Systems

In a simulation, a helpful internal AI storage bot was manipulated by an external attacker's prompt. It then autonomously escalated privileges, disabled Windows Defender, and compromised its own network, demonstrating a new vector for sophisticated insider threats.

Securing the AI Frontier: Irregular Co-founder Dan Lahav

Training Data·6 months ago

Users Weaponize Prompt Injection to Break Corporate AI Customer Service Chatbots

A viral thread showed a user tricking a United Airlines AI bot using prompt injection to bypass its programming. This highlights a new brand vulnerability where organized groups could coordinate attacks to disable or manipulate a company's customer-facing AI, turning a cost-saving tool into a PR crisis.

#166: OpenAI Jobs Platform, Salesforce AI Job Cuts, White House AI Education Initiative & OpenAI Secondary Sale and Cash Burn

The Artificial Intelligence Show·8 months ago

AI Agents Are Vulnerable to 'Prompt Injection' From Untrusted Data

A major security flaw in AI agents is 'prompt injection.' If an AI accesses external data (e.g., a blog post), a malicious actor can embed hidden commands in that data, tricking the AI into executing them. There is currently no robust defense against this.

We built OpenClaw Ultron to replace 20 people at our company | E2246

This Week in Startups·3 months ago

AI Compresses Cybersecurity Attack Timelines From Days to Under an Hour

AI tools drastically accelerate an attacker's ability to find weaknesses, breach systems, and steal data. The attack window has shrunk from days to as little as 23 minutes, making traditional, human-led response times obsolete and demanding automated, near-instantaneous defense.

The Impact of AI, from Business Models to Cybersecurity, with Palo Alto Networks CEO Nikesh Arora

No Priors: Artificial Intelligence | Technology | Startups·7 months ago

AI Agents Can Be Hacked Through Trusted Data Sources via Indirect Prompt Injection

Beyond direct malicious user input, AI agents are vulnerable to indirect prompt injection. An attack payload can be hidden within a seemingly harmless data source, like a webpage, which the agent processes at a legitimate user's request, causing unintended actions.

5 Ways Your AI Agent Will Get Hacked (And How to Stop Each One)

Machine Learning Tech Brief By HackerNoon·4 months ago

The Model Context Protocol (MCP) Became the 'USB-C for AI' But Lacked Essential Security

MCP emerged as a critical standard for AI agents to interact with tools, much like USB-C for hardware. However, its rapid adoption overlooked security, leading to significant vulnerabilities like tool poisoning and prompt injection attacks in its early, widespread implementations.

The Year of the Agent

Machine Learning Tech Brief By HackerNoon·4 months ago

Invisible Prompt Injections on Websites Pose a Systemic Risk to AI Browsers

Research shows that text invisible to humans can be embedded on websites to give malicious commands to AI browsers. This "prompt injection" vulnerability could allow bad actors to hijack the browser to perform unauthorized actions like transferring funds, posing a major security and trust issue for the entire category.

OpenAI’s Risky Browser Bet, Amazon’s Mass Automation Plan, Clippy’s Back

Big Technology Podcast·6 months ago

The 'Lethal Trifecta' Makes AI Agents Uniquely Vulnerable to Hacking

AI agents are a security nightmare due to a "lethal trifecta" of vulnerabilities: 1) access to private user data, 2) exposure to untrusted content (like emails), and 3) the ability to execute actions. This combination creates a massive attack surface for prompt injections.

AI Bots Take Over | E2242

This Week in Startups·3 months ago

AI Assistants Become Insecure Due to a 'Lethal Trifecta' of Permissions

AI researcher Simon Willis identifies a 'lethal trifecta' that makes AI systems vulnerable: access to insecure outside content, access to private information, and the ability to communicate externally. Combining these three permissions—each valuable for functionality—creates an inherently exploitable system that can be used to steal data.

Shut happens: US federal funding stops

Economist Podcasts·7 months ago

Get your free personalized podcast brief

Related Insights