Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

To get golfers and coaches to adopt the "Strokes Gained" metric, the PGA Tour presented them with two anonymized player rankings. Stakeholders consistently chose the ranking generated by the new metric as more accurate, leading to adoption without needing to explain the complex underlying analytics.

Related Insights

To ensure AI labs don't provide specially optimized private endpoints for evaluation, the firm creates anonymous accounts to test the same public models everyone else uses. This "mystery shopper" policy maintains the integrity and independence of their results.

In an analysis of 50 past email campaigns, ChatGPT's 5.2 model correctly identified the winning A/B test variation 89% of the time without performance data. Marketers can use this predictive capability to vet campaign elements like subject lines and creative before launching live tests, potentially saving time and resources.

Conative.ai onboarded skeptical inventory planners by having them compare their manual forecasts against the AI's for 2-4 weeks. This "bake-off" quickly demonstrated the AI's accuracy and immense time savings, effectively converting users who initially trusted their own experience over the technology.

Mark Broadie's "Strokes Gained" analysis revealed that ball striking (driving and approach shots) accounts for two-thirds of the skill difference between top pros and average ones. The long-held belief that putting was the key differentiator was incorrect, showcasing how data can overturn conventional wisdom.

DBS quantifies the ROI of its AI by tracking revenue generated from A/B tested customer "nudges." This practical application, which yielded $750 million, provides a direct feedback loop on whether AI-driven offers are effective, moving beyond simple efficiency metrics to prove top-line growth.

Instead of focusing on grand projects that yielded little return, The Atlantic's subscription growth was driven by a culture of data science and iterative testing. They ran over 230 A/B tests in a single year on their paywall, proving that small, continuous improvements can create massive results.

To set realistic success metrics for new AI tools, Descript used its most popular pre-AI feature, "remove filler words," as the baseline. They compared adoption and retention of new AI features against this known winner, providing a clear, internal benchmark for what "good" looks like instead of guessing at targets.

In the debate between data-driven AB testing and intuitive 'taste' for product design, a humorous but practical career tip emerged: run the AB test to find the optimal solution (e.g., a blue button). Then, instead of presenting the data, confidently tell leadership the choice was based on your superior 'taste,' thereby building a reputation for invaluable intuition.

When pitching a move away from legacy metrics like MQLs, don't just present flaws. Frame the new model as a superior, more predictable growth equation. Executives need a reliable forecasting model, so give them a new 'plug and play' formula to secure their buy-in.

To get Google's TPU team to adopt their AI, the AlphaChip founders overcame deep skepticism through a relentless two-year process of weekly data reviews, proving their AI was superior on every single metric before engineers would risk their careers on the unconventional designs.