Founder Josh Pigford's "But For Real" Skill Bullies AI into Finding Its Own Bugs

Related Insights

Anthropic Finds AI Skills for Verifying Code Deliver Higher ROI Than Generation Skills

Anthropic's Claude Code team reports that AI agent skills designed for "verification"—teaching an agent to test and validate its own output—provide an extremely high return on investment. This suggests that building reliability and correctness into AI workflows is as critical, if not more so, than the initial generation capability.

How to Use Agent Skills

The AI Daily Brief: Artificial Intelligence News and Analysis·4 months ago

Use A Rival AI Model (GPT) to Adversarially Review Your Primary Model's (Opus) Code

Instead of relying on a single AI model, Josh Pigford's workflow uses Opus for initial code generation and then runs a review pass with a different powerful model like GPT. This adversarial, multi-model process consistently uncovers 3-5 bugs that the primary model overlooks.

The Exact AI Skills This Solo Founder Uses to Build 5 Apps at Once | Josh Pigford

Behind the Craft·2 months ago

Pit Competing LLMs (Claude, Codex, Gemini) Against Each Other for Robust Code Reviews

To overcome the challenge of reviewing AI-generated code, have different LLMs like Claude and Codex review the code. Then, use a "peer review" prompt that forces the primary LLM to defend its choices or fix the issues raised by its "peers." This adversarial process catches more bugs and improves overall code quality.

The non-technical PM’s guide to building with Cursor | Zevi Arnovitz (Meta)

Lenny's Podcast: Product | Career | Growth·6 months ago

Program Your AI to "Grill You" and Expose Your Personal Biases

Move beyond using AI as an assistant and program it to be a critical sparring partner. Pendo's Field CPO had his AI analyze his codebase and brutally call him out for building a system for himself, not for others, forcing a strategic realignment.

This CPO Uses Claude Code to Run his Entire Work Life | Dave Killeen, Field CPO @ Pendo

The Growth Podcast·4 months ago

Improve AI Accuracy by Pitting "Opponent" Sub-Agents Against Each Other

To improve the quality and accuracy of an AI agent's output, spawn multiple sub-agents with competing or adversarial roles. For example, a code review agent finds bugs, while several "auditor" agents check for false positives, resulting in a more reliable final analysis.

Inside Claude Code From the Engineers Who Built It

AI & I·9 months ago

Prompting an AI to Critique Its Own Work as an Expert Persona Improves Accuracy

An effective method for refining AI output is to instruct the model to adopt an expert persona, such as a "PhD economist," and critically evaluate its own work. This often leads the model to self-identify and correct its own flaws without further prompting.

Inside AI with Anthropic's Peter McCrory

Moody's Talks - Inside Economics·3 months ago

Create a "Learnings" Skill to Make Your AI Agent Self-Correct From Past Mistakes

Pigford built a meta-skill that reviews each development session, including conversations where he repeatedly corrected the AI. It then distills these corrections into a central project document, effectively teaching the AI agent not to make the same mistakes in future sessions.

The Exact AI Skills This Solo Founder Uses to Build 5 Apps at Once | Josh Pigford

Behind the Craft·2 months ago

Use AI to Adversarially Review Software Specs to Expose Flaws Before Coding Begins

A powerful technique for creating robust software plans is to use AI as an adversarial partner. After drafting a specification, prompt an AI to "tear it apart" by identifying underspecified or inconsistent points. Iterate on this process until the AI's feedback becomes niche, indicating a solid spec.

970: The “100x Engineer”: How to Be One, But Should You?

Super Data Science: ML & AI Podcast with Jon Krohn·5 months ago

A 'Gotcha' Section Detailing Common AI Failures Is the Most Critical Part of a Skill

The most valuable part of an AI agent skill is a 'gotcha' section. This is where you explicitly instruct the model on its typical failure patterns and wrong assumptions for a given task, preventing common errors before they happen.

Agent Skills Masterclass

The AI Daily Brief: Artificial Intelligence News and Analysis·4 months ago

Build Self-Improving AI Skills That Learn From Novel Fixes and Proactively Find Similar Issues

An Intercom AI skill for fixing flaky tests goes beyond a simple script. It updates its own internal checklist when it encounters a new type of fix and then proactively searches the codebase for similar problems, creating a 100x impact.

How Intercom 2x’d their engineering velocity in 9 months with Claude Code | Brian Scanlan

How I AI·3 months ago

Get your free personalized podcast brief

Related Insights