Generative AI Developers Use a 'Workhorse' and 'Hero' Model Strategy

Related Insights

For AI Products, Always Use the Best Model; Scaffolding Around Inferior Models Is Wasted Effort

Box CEO Aaron Levie advises against building complex workarounds for the limitations of cheaper, older AI models. This "scaffolding" becomes obsolete with each new model release. To stay competitive, companies must absorb the cost of using the best available model, as competitors will certainly do so.

Box CEO Aaron Levie on Why AI Agents Won’t Take Your Job

AI & I·4 months ago

Production AI Video Workflows Chain 14+ Specialized Models, Not a Single Prompt

Advanced generative media workflows are not simple text-to-video prompts. Top customers chain an average of 14 different models for tasks like image generation, upscaling, and image-to-video transitions. This multi-model complexity is a key reason developers prefer open-source for its granular control over each step.

The Rise of Generative Media: fal's Bet on Video, Infrastructure, and Speed

Training Data·2 months ago

Effective Enterprise AI Requires an "LLM Agnostic Orchestrator" to Deploy the Best Model

Recognizing there is no single "best" LLM, AlphaSense built a system to test and deploy various models for different tasks. This allows them to optimize for performance and even stylistic preferences, using different models for their buy-side finance clients versus their corporate users.

Jack Kokko – Building the Google of Finance at AlphaSense (EP.461)

Capital Allocators – Inside the Institutional Investment Industry·5 months ago

Generative Video is 10,000x More Compute-Intensive Than an LLM Prompt

The computational requirements for generative media scale dramatically across modalities. If a 200-token LLM prompt costs 1 unit of compute, a single image costs 100x that, and a 5-second video costs another 100x on top of that—a 10,000x total increase. 4K video adds another 10x multiplier.

The Rise of Generative Media: fal's Bet on Video, Infrastructure, and Speed

Training Data·2 months ago

Create a “Dream Team” of Specialized AI Agents, Not One Generalist Employee

Building a single, all-purpose AI is like hiring one person for every company role. To maximize accuracy and creativity, build multiple custom GPTs, each trained for a specific function like copywriting or operations, and have them collaborate.

933: How to Build Your AI Dream Team (Without Losing the Human Touch)

The Goal Digger Podcast | Top Business and Marketing Podcast for Creatives, Entrepreneurs, and Women in Business·3 months ago

OpenAI Abandons 'One Model' Dream for a Portfolio of Specialized Models

Initially, even OpenAI believed a single, ultimate 'model to rule them all' would emerge. This thinking has completely changed to favor a proliferation of specialized models, creating a healthier, less winner-take-all ecosystem where different models serve different needs.

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

a16z Podcast·3 months ago

Use Creative Generative AI for Design, But Deploy Predictable AI for Runtime Execution to Avoid Cost and Risk

Pega's CTO advises using the powerful reasoning of LLMs to design processes and marketing offers. However, at runtime, switch to faster, cheaper, and more consistent predictive models. This avoids the unpredictability, cost, and risk of calling expensive LLMs for every live customer interaction.

#763: Pega CTO Don Schuerman on how AI can pay down tech debt and accelerate digital transformation

The Agile Brand with Greg Kihlström®: Expert Mode Marketing Technology, AI, & CX·3 months ago

The Future of AI Development Is Using a Portfolio of Specialized Agents

Instead of relying on a single, all-purpose coding agent, the most effective workflow involves using different agents for their specific strengths. For example, using the 'Friday' agent for UI tasks, 'Charlie' for code reviews, and 'Claude Code' for research and backend logic.

Best of the Pod: Claude Code - How Two Engineers Ship Like a Team of 15

AI & I·3 months ago

AI Companies Should Create Branded 'Composite Models' to Improve Performance and Decouple from Labs

Instead of offering a model selector, creating a proprietary, branded model allows a company to chain different specialized models for various sub-tasks (e.g., search, generation). This not only improves overall performance but also provides business independence from the pricing and launch cycles of a single frontier model lab.

⚡ Inside GitHub’s AI Revolution: Jared Palmer Reveals Agent HQ & The Future of Coding Agents

Latent Space: The AI Engineer Podcast·3 months ago