We scan new podcasts and send you the top 5 insights daily.
Similar to how an LLM uses a 'chain of thought' to reason, Genesis's model 'thinks' by iteratively refining an in-memory representation of a crystal structure. This process is guided by physics-based principles, significantly improving the final prediction's accuracy.
Genesis achieved its sub-one-angstrom accuracy not through one algorithmic trick, but by making it a core objective from the start. This obsessive focus on the right metric guided countless small, compounding decisions across data, infrastructure, and modeling.
Instead of building from scratch, ProPhet leverages existing transformer models to create unique mathematical 'languages' for proteins and molecules. Their core innovation is an additional model that translates between them, creating a unified space to predict interactions at scale.
The community standard of two-angstrom accuracy for protein-ligand predictions is insufficient. At that resolution, critical details like an aromatic ring's orientation can be wrong, rendering the model's output misleading for drug design. Genesis argues one-angstrom accuracy is the minimum for practical utility.
A key strategy for improving results from generative protein models is "inference-time scaling." This involves generating a vast number of potential structures and then using a separate, fine-tuned scoring model to rank them. This search-and-rank process uncovers high-quality solutions the model might otherwise miss.
Modern protein models use a generative approach (diffusion) instead of regression. Instead of predicting one "correct" structure, they model a distribution of possibilities. This better handles molecular dynamism and avoids averaging between multiple valid states, which is a flaw of regression models.
Large Language Models are limited because they lack an understanding of the physical world. The next evolution is 'World Models'—AI trained on real-world sensory data to understand physics, space, and context. This is the foundational technology required to unlock physical AI like advanced robotics.
The public database of protein structures (PDB) is small and grows slowly. To train more powerful models, Genesis leverages physics simulations to model small molecule behavior, creating a large, high-quality synthetic dataset that isn't possible for more complex protein-protein interactions.
EBMs are based on a fundamental principle in physics where systems naturally seek their lowest energy state (e.g., sitting on a couch when tired). The model maps all possible outcomes onto an 'energy landscape,' where the lowest points represent the most probable solutions. This avoids the expensive, token-by-token guessing game played by LLMs.
Periodic Labs doesn't use a single monolithic model. Instead, a powerful language model acts as a central coordinator or "copilot." It directs experiments by calling upon smaller, highly specialized, and more efficient neural nets (e.g., those with symmetry awareness for atomic systems) as tools.
Generative AI alone designs proteins that look correct on paper but often fail in the lab. DenovAI adds a physics layer to simulate molecular dynamics—the "jiggling and wiggling"—which weeds out false positives by modeling how proteins actually interact in the real world.