Controlling Dangerous Biological Data Is a More Effective Chokepoint Than Controlling AI Models

Related Insights

AI Models for Drug Safety Can Be Inverted to Design Novel Toxins

Models designed to predict and screen out compounds toxic to human cells have a serious dual-use problem. A malicious actor could repurpose the exact same technology to search for or design novel, highly toxic molecules for which no countermeasures exist, a risk the researchers initially overlooked.

AI Discovered Antibiotics: How Small Data & Small GNNs Led to Big Results, w/ MIT Prof. Jim Collins

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·7 months ago

Open Source AI Makes Malicious "Nudify" Apps a Distribution Problem, Not Technical

The ease of finding AI "undressing" apps (85 sites found in an hour) reveals a critical vulnerability. Because open-source models can be trained for this purpose, technical filters from major labs like OpenAI are insufficient. The core issue is uncontrolled distribution, making it a societal awareness challenge.

#178: OpenAI’s Automated AI Researcher, OpenAI Restructuring, The Fed Warns About AI’s Impact on Hiring, Nvidia Hits $5 Trillion & Wharton Data on AI ROI

The Artificial Intelligence Show·6 months ago

China's Open-Weight AI Strategy Relies on Pre-Training Data Filtering for Safety

China remains committed to open-weight models, seeing them as beneficial for innovation. Its primary safety strategy is to remove hazardous knowledge (e.g., bioweapons information) from the training data itself. This makes the public model inherently safer, rather than relying solely on post-training refusal mechanisms that can be circumvented.

Chinese AI – They're Just Like Us? With Beijing-Based Concordia AI CEO Brian Tse

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·6 months ago

Biosecurity Risk for AI Protein Design Must Focus on Physical Gene Synthesis, Not Digital Models

The danger of AI creating harmful proteins is not in the digital design but in its physical creation. A protein sequence on a computer is harmless. The critical control point is the gene synthesis process. Therefore, biosecurity efforts should focus on providing advanced screening tools to synthesis providers.

An AI Collaborative that Welcomes All into the Fold

The Bio Report·6 months ago

AI-Designed Proteins Break Sequence-Based Biosecurity, Requiring Function-Prediction Tools

Current biosecurity screens for threats by matching DNA sequences to known pathogens. However, AI can design novel proteins that perform a harmful function without any sequence similarity to existing threats. This necessitates new security tools that can predict a protein's function, a concept termed "defensive acceleration."

An AI Collaborative that Welcomes All into the Fold

The Bio Report·6 months ago

Publishing Pathogen Genomes Creates an Unvettable Global Threat from Thousands of Individuals

Deep Vision's plan to publish the genomes of deadly viruses would effectively give the "killing power of a nuclear arsenal" to an estimated 30,000 unvetted individuals with synthetic biology skills. In the bio-age, openly publishing certain information can be a greater security threat than physical weapons.

#463 — Privatizing the Apocalypse

Making Sense with Sam Harris·2 months ago

Removing Just Human-Infecting Virus Data Cripples AI's Harmful Potential

Research on bio-foundation models like EVO2 and ESM3 shows that strategically excluding key datasets (e.g., sequences of viruses that infect humans) dramatically reduces a model's performance on dangerous tasks, often to random chance, without harming its useful scientific capabilities.

Bioinfohazards: Jassi Pannu on Controlling Dangerous Data from which AI Models Learn

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Proposed Data Security Framework Would Restrict Only 1% of Biological Data

A biosecurity data-level (BDL) framework, modeled after biosafety levels for labs, would keep 99% of biological data open-access. Only the top 1% of data—that which links pathogen sequences to dangerous properties like transmissibility—would face restrictions like requiring use-approval.

Bioinfohazards: Jassi Pannu on Controlling Dangerous Data from which AI Models Learn

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Training All AIs on the Same Data Creates a "Latent Space Monoculture" Vulnerable to System-Wide Failure

When all major AI models are trained on the same internet data, they develop similar internal representations ("latent spaces"). This creates a monoculture where a single exploit or "memetic virus" could compromise all AIs simultaneously, arguing for the necessity of diverse datasets and training methods.

The Machines Are Taking Our Jobs - Thank God? Emad Mostaque’s Guide to the next 1000 Days

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·7 months ago

AI Dramatically Lowers the Barrier to Creating Bioweapons, Shifting the Threat from Nation-States to Small Groups

Valthos CEO Kathleen, a biodefense expert, warns that AI's primary threat in biology is asymmetry. It drastically reduces the cost and expertise required to engineer a pathogen. The primary concern is no longer just sophisticated state-sponsored programs but small groups of graduate students with lab access, massively expanding the threat landscape.

Charting The Media Landscape, WSJ Mansion Section, Emily Sundberg LIVE in The Ultradome | Jordan Schneider, Saagar Enjeti, Justine Moore, Glenn Solomon, Dion Harris & More

TBPN·6 months ago

Get your free personalized podcast brief

Related Insights