Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

We don't fully understand how advanced AI models work. Creators don't program them with explicit knowledge but train them on vast datasets and then run experiments to discover their capabilities. This makes AI development more of a science—studying an unpredictable artifact—than traditional engineering, highlighting an inherent lack of control.

Related Insights

The future of AI is hard to predict because increasing a model's scale often produces 'emergent properties'—new capabilities that were not designed or anticipated. This means even experts are often surprised by what new, larger models can do, making the development path non-linear.

Unlike traditional software where features are explicitly coded, frontier AI systems are trained on vast datasets, leading to emergent abilities. Their internal mechanisms are not directly designed, which is why developers struggle to reliably instill intended goals and prevent unwanted behaviors.

The ambition to fully reverse-engineer AI models into simple, understandable components is proving unrealistic as their internal workings are messy and complex. Its practical value is less about achieving guarantees and more about coarse-grained analysis, such as identifying when specific high-level capabilities are being used.

Large language models are like "alien technology"; their creators understand the inputs and outputs but not the "why" of their learning process. This reality requires leaders to be vigilant about managing AI's limitations and unpredictability, such as hallucinations.

AI development is more like farming than engineering. Companies create conditions for models to learn but don't directly code their behaviors. This leads to a lack of deep understanding and results in emergent, unpredictable actions that were never explicitly programmed.

Vercel designer Pranati Perry advises viewing AI models as interns. This mindset shifts the focus from blindly accepting output to actively guiding the AI and reviewing its work. This collaborative approach helps designers build deeper technical understanding rather than just shipping code they don't comprehend.

Unlike traditional software development that starts with unit tests for quality assurance, AI product development often begins with 'vibe testing.' Developers test a broad hypothesis to see if the model's output *feels* right, prioritizing creative exploration over rigid, predefined test cases at the outset.

Building machines that learn from vast datasets leads to unpredictable outcomes. OpenAI's GPT-3, trained on text, spontaneously learned to write computer programs—a skill its designers did not explicitly teach it or expect it to acquire. This highlights the emergent and mysterious nature of modern AI.

Geoffrey Irving describes the training process at frontier labs as an impure 'mess.' It's an emergent system with hundreds of engineers, constantly changing datasets, and many ad-hoc checks, not a clean, theoretical process. New techniques don't simplify this; they just add another variable into the complex mix.

Unlike traditional software, large language models are not programmed with specific instructions. They evolve through a process where different strategies are tried, and those that receive positive rewards are repeated, making their behaviors emergent and sometimes unpredictable.