Apple's seemingly slow AI progress is likely a strategic bet that today's powerful cloud-based models will become efficient enough to run locally on devices within 12 months. This would allow them to offer powerful AI with superior privacy, potentially leapfrogging competitors.
Unlike competitors feeling pressure to build proprietary AI foundation models, Apple can simply partner with providers like Google. This reveals Apple's true moat isn't the model itself but its massive hardware distribution network, giving it leverage to integrate best-in-class AI without the high cost of in-house development.
The vast network of consumer devices represents a massive, underutilized compute resource. Companies like Apple and Tesla can leverage these devices for AI workloads when they're idle, creating a virtual cloud where users have already paid for the hardware (CapEx).
Models like Gemini 3 Flash show a key trend: making frontier intelligence faster, cheaper, and more efficient. The trajectory is for today's state-of-the-art models to become 10x cheaper within a year, enabling widespread, low-latency, and on-device deployment.
Apple isn't trying to build the next frontier AI model. Instead, their strategy is to become the primary distribution channel by compressing and running competitors' state-of-the-art models directly on devices. This play leverages their hardware ecosystem to offer superior privacy and performance.
Apple's historic commitment to user privacy prevented it from using the vast data pools competitors leveraged for AI. This created a technical disadvantage, forcing Apple to use its marketing prowess ('lipstick') to mask a technologically inferior AI product ('the pig').
The future of AI isn't just in the cloud. Personal devices, like Apple's future Macs, will run sophisticated LLMs locally. This enables hyper-personalized, private AI that can index and interact with your local files, photos, and emails without sending sensitive data to third-party servers, fundamentally changing the user experience.
While critics say Apple "missed AI," its strategy of partnering with Google for Gemini is a masterstroke. Apple avoids billions in CapEx, sidesteps brand-damaging AI controversies, and maintains control over the lucrative user interface, positioning itself to win the "agent of commerce" war.
A cost-effective AI architecture involves using a small, local model on the user's device to pre-process requests. This local AI can condense large inputs into an efficient, smaller prompt before sending it to the expensive, powerful cloud model, optimizing resource usage.
The biggest risk to the massive AI compute buildout isn't that scaling laws will break, but that consumers will be satisfied with a "115 IQ" AI running for free on their devices. If edge AI is sufficient for most tasks, it undermines the economic model for ever-larger, centralized "God models" in the cloud.
By licensing Google's Gemini for Siri, Apple is strategically avoiding the capital-intensive foundation model war. This allows them to focus resources on their core strength: silicon and on-device AI. The long-term vision is a future where Apple dominates the "edge," interoperating with cloud AIs.