Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

For creative AI tools, quantitative benchmarks are insufficient. Descript relies on 'vibes' and the curated aesthetic judgment of trusted tastemakers to evaluate and select the best generative models, echoing Midjourney's strategy of having a 'thumb on the scale'.

Related Insights

When every company has access to the same powerful AI tools, the competitive advantage is no longer budget or technology. The real differentiator becomes human taste, judgment, and the ability to apply a unique point of view to guide the AI, separating average, generic output from exceptional work.

The goal of testing multiple AI models isn't to crown a universal winner, but to build your own subjective "rule of thumb" for which model works best for the specific tasks you frequently perform. This personal topography is more valuable than any generic benchmark.

Creativity is simply remixing existing concepts, a task at which AI excels. Its current primary limitation is in selection. AI can generate a thousand options but doesn't know which one will best appeal to human taste, which requires a uniquely human ability to balance novelty and familiarity.

As AI democratizes the technical aspects of content creation, the ability to guide it with unique perspective, craft, and taste becomes the key differentiator. AI is a powerful tool for experts to scale their vision, but it cannot replace the vision itself.

Startups like ElevenLabs and Midjourney compete with large AI labs by imbuing their models with a founder's specific 'taste.' This unique aesthetic, from voice texture to image style, creates a product identity that is difficult for a general, large-scale model to replicate.

To codify a specific person's "taste" in writing, the team fed the DSPy framework a dataset of tweets with thumbs up/down ratings and explanations. DSPy then optimized a prompt that created an AI "judge" capable of evaluating new content with 76.5% accuracy against that person's preferences.

The best AI models are trained on data that reflects deep, subjective qualities—not just simple criteria. This "taste" is a key differentiator, influencing everything from code generation to creative writing, and is shaped by the values of the frontier lab.

AI tools enable "vibe coding," where you describe a desired outcome or feeling (e.g., "make the crowd go wild") rather than technical specifications. This decouples taste (what you want) from skill (how to make it), opening creative fields to non-experts.

AI tools can drastically increase the volume of initial creative explorations, moving from 3 directions to 10 or more. The designer's role then shifts from pure creation to expert curation, using their taste to edit AI outputs into winning concepts.

Anthropic's Claude is gaining traction not just on technical benchmarks, but because users perceive it as having a "soul" and feeling "artisan." This indicates that for consumer AI, subjective qualities like personality, craft, and a non-robotic feel are becoming critical competitive advantages over pure utility.