Criteo's Modular AI Uses Multiple Foundation Models to Power Experimentation

Related Insights

AI App Layer Value Comes from Aggregating Specialized Models, Not Just Building on One

The true power of the AI application layer lies in orchestrating multiple, specialized foundation models. Users want a single interface (like Cursor for coding) that intelligently routes tasks to the best model (e.g., Gemini for front-end, Codex for back-end), creating value through aggregation and workflow integration.

Anish Acharya: Is SaaS Dead in a World of AI?

The a16z Show·3 months ago

Stripe Accelerates AI Adoption by Adding Foundation Model Embeddings to Existing ML Systems

Stripe avoids costly system rebuilds by treating its new payments foundation model as a modular component. Its powerful embeddings are simply added as new features to many existing ML classifiers, instantly boosting their performance with minimal engineering effort.

Stripe's Payments Foundation Model: How Data & Infra Create Compounding Advantage, w/ Emily Sands

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·8 months ago

A Multi-Model Strategy Turns AI Applications into an 'Index of AI Innovation'

Instead of being a weakness, Cursor's reliance on multiple foundation models is a key strength. With 50% of developers switching model families daily, this approach allows Cursor to benefit from every improvement in any underlying model. This creates a compounding product flywheel, making the application layer an index of the entire AI ecosystem's progress.

20VC: Inside Accel's $4BN Growth Investing Machine | Cursor is Dead is Total BS: Here is Why | What Missing Rippling and ElevenLabs Taught Us | Are $2BN-$10BN IPOs Dead | Why Now is a Great Time to be Thoma Bravo with Miles Clements

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·2 months ago

Adapt a Single AI Base Model for Multiple Specialized Workflows Using LoRa

Low-Rank Adaptation (LoRa) allows a single base AI model to be efficiently fine-tuned into multiple, distinct specialist models. This is a powerful strategy for companies needing varied editing capabilities, such as for different client aesthetics, without the high cost of training and maintaining separate large models.

FLUX.2 klein Trainer (Edit): Fine-Tune LoRAs on a Lean 4B Base

Machine Learning Tech Brief By HackerNoon·3 months ago

AI Startups Use a Multi-Model "Hodgepodge" to Optimize for Specific Workflows

Rather than committing to a single LLM provider like OpenAI or Gemini, Hux uses multiple commercial models. They've found that different models excel at different tasks within their app. This multi-model strategy allows them to optimize for quality and latency on a per-workflow basis, avoiding a one-size-fits-all compromise.

iPhone Air is “inspiring,” and a first step toward Apple Glasses (w/ Zach Handshoe of SpatialGen) | E2200

This Week in Startups·7 months ago

Combining Diverse LLMs into a "Society of Minds" Produces More Efficient Solutions

By making different foundation models (like Gemini and Claude) collaborate, developers can achieve superior outcomes. One model's unique knowledge, such as using a free RSS feed instead of costly APIs, can create vastly more efficient and creative solutions than a single model could alone.

GROK 4.20 and the "SOCIETY OF MINDS"

AI Pod by Wes Roth and Dylan Curious | Artificial Intelligence News and Interviews With Experts·2 months ago

AI Strategy Is Shifting from Building Foundational Models to Integrating Them

The initial AI rush for every company to build proprietary models is over. The new winning strategy, seen with firms like Adobe, is to leverage existing product distribution by integrating multiple best-in-class third-party models, enabling faster and more powerful user experiences.

Guide to the AI Barnyard, Eli Lilly Hits $1T Valuation, Has AI Ruined the Em Dash? | Julia Steinberg, Bobby Ghoshal

TBPN·6 months ago

AI Winners Orchestrate Multiple Models; Application Design Trumps Raw Model Size

The belief that a single, god-level foundation model would dominate has proven false. Horowitz points to successful AI applications like Cursor, which uses 13 different models. This shows that value lies in the complex orchestration and design at the application layer, not just in having the largest single model.

Ben Horowitz on Investing in AI: AI Bubbles, Economic Impact, and VC Acceleration

The a16z Show·4 months ago

AdTech AI Evolved from Sparse Handcrafted Features to Dense Deep Learning Embeddings

Criteo's models moved from using manually crafted, extremely high-dimensional sparse vectors (e.g., 2^12 features) with linear models to dense vectors (a few hundred features) automatically computed by deep learning algorithms. This shift eliminated manual feature engineering and improved model adaptability.

Milliseconds to Match: Criteo's AdTech AI & the Future of Commerce w/ Diarmuid Gill & Liva Ralaivola

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 days ago

AI Companies Should Create Branded 'Composite Models' to Improve Performance and Decouple from Labs

Instead of offering a model selector, creating a proprietary, branded model allows a company to chain different specialized models for various sub-tasks (e.g., search, generation). This not only improves overall performance but also provides business independence from the pricing and launch cycles of a single frontier model lab.

⚡ Inside GitHub’s AI Revolution: Jared Palmer Reveals Agent HQ & The Future of Coding Agents

Latent Space: The AI Engineer Podcast·6 months ago

Get your free personalized podcast brief

Related Insights