Block's CTO believes the key to building complex applications with AI isn't a single, powerful model. Instead, he predicts a future of "swarm intelligence"—where hundreds of smaller, cheaper, open-source agents work collaboratively, with their collective capability surpassing any individual large model.
True Agentic AI isn't a single, all-powerful bot. It's an orchestrated system of multiple, specialized agents, each performing a single task (e.g., qualifying, booking, analyzing). This 'division of labor,' mirroring software engineering principles, creates a more robust, scalable, and manageable automation pipeline.
Block's CTO argues that LLMs are a wasted resource when they sit idle overnight and on weekends. He envisions a future where AI agents work continuously, proactively building features, running multiple experiments in parallel, and anticipating the needs of the human team so that new options are ready for review in the morning.
In an extreme example of recursive development, Block's team uses their open-source AI agent, Goose, to write most of the new code for the Goose project itself. The ultimate goal is for the agent to become completely autonomous, rewriting itself from scratch for each release.
Instead of pre-engineering tool integrations, Block lets its AI agent Goose learn by doing. Successful user-driven workflows can be saved as shareable "recipes," allowing emergent capabilities to be captured and scaled. They found the agent is more capable this way than if they tried to make tools "Goose-friendly."
Fears of a single AI company achieving runaway dominance are proving unfounded, as the number of frontier models has tripled in a year. Newcomers can use techniques like synthetic data generation to effectively "drink the milkshake" of incumbents, reverse-engineering their intelligence at lower costs.
The AI arms race will shift from building ever-larger general models to creating smaller, highly specialized models for domains like medicine and law. General AIs will evolve to act as "general contractors," routing user queries to the appropriate specialist model for deeper expertise.
The key to successful open-source AI isn't uniting everyone into a massive project. Instead, EleutherAI's model proves more effective: creating small, siloed teams with guaranteed compute and end-to-end funding for a single, specific research problem. This avoids organizational overhead and ensures completion.
Initially, even OpenAI believed a single, ultimate 'model to rule them all' would emerge. This thinking has completely changed to favor a proliferation of specialized models, creating a healthier, less winner-take-all ecosystem where different models serve different needs.
Instead of relying on a single, all-purpose coding agent, the most effective workflow involves using different agents for their specific strengths. For example, using the 'Friday' agent for UI tasks, 'Charlie' for code reviews, and 'Claude Code' for research and backend logic.
Replit's leap in AI agent autonomy isn't from a single superior model, but from orchestrating multiple specialized agents using models from various providers. This multi-agent approach creates a different, faster scaling paradigm for task completion compared to single-model evaluations, suggesting a new direction for agent research.