We scan new podcasts and send you the top 5 insights daily.
Nebius conceptualizes its growth in four layers: 1) Bare metal (megawatts) for hyperscalers, 2) Managed cloud (GPU hours) for researchers, 3) Managed inference (tokens) for AI companies, and 4) Agentic platforms (tasks) for developers. This strategy moves them up the value stack, away from pure commodity infrastructure.
Firms like OpenAI and Meta claim a compute shortage while also exploring selling compute capacity. This isn't a contradiction but a strategic evolution. They are buying all available supply to secure their own needs and then arbitraging the excess, effectively becoming smaller-scale cloud providers for AI.
A new category of "NeoCloud" or "AI-native cloud" is rising, focusing specifically on AI training and inference. Unlike general-purpose clouds like AWS, these platforms are GPU-first, catering to massive AI workloads and addressing the GPU scarcity and different workload patterns found in hyperscalers.
Nebius's talks to acquire AI21 reflect a broader trend where NeoClouds (e.g., CoreWeave) are buying software companies. This strategy aims to create a full-stack platform, offering more than just compute power, thereby increasing customer stickiness and diversifying revenue streams beyond commoditized hardware rentals.
Modal Labs provides an infrastructure layer that sits above hyperscalers and specialized AI clouds. Its value is not owning hardware but abstracting the complexity of managing raw GPU capacity. By offering a superior developer experience and a flexible, usage-based model, it solves the variable demand problem inherent in AI applications.
Nebius's competitive edge is full vertical integration. By controlling the stack "down" to building its own data centers, it gains cost and speed advantages. By building "up" with software platforms, it accesses enterprise markets that competitors focused on raw compute cannot.
In the crowded GPU reseller market, startups like Modal justify high valuations by offering more than just compute. A key driver of Modal's growth is its 'Sandboxes' product, a specialized software layer for safely running AI agents, demonstrating that value is moving from raw infrastructure to agent-specific tooling.
Nebius’s $27B infrastructure deal with Meta is seen as a "moment in the market," serving Meta's short-term capacity crunch. Nebius's core strategy focuses on the thousands of other enterprise customers who need to fulfill their AI requirements, not on retaining hyperscalers long-term.
The enormous scale of Meta's deal with specialized data center operator Nebius proves that "NeoClouds" are now critical infrastructure players. They are successfully competing with hyperscalers by offering specialized services and, crucially, available capacity, making them essential partners for AI giants.
Nebius's CBO rejects the "NeoCloud" label, which often implies just bare-metal compute. The company strategically defines itself as a "vertically integrated full-stack AI specialized cloud." This emphasizes its value beyond hardware, focusing on the flexibility, tools, and developer experience that distinguish it from simple server rental.
As foundational AI models become commoditized 'intelligence utilities,' the economic value moves up the stack. Orchestrators like OpenClaw, which can intelligently route tasks to the most efficient model based on cost or use case, are positioned to capture the margin that the underlying model providers cannot.