An OpenAI PM Used a Self-Built Codex App to Check His Accountant's Tax Filing and Found a Mistake

Related Insights

Use an LLM as a Final Audit on Your Tax Return to Catch Errors Pros Miss

Even if you use a professional accountant, running your draft tax return through an LLM can serve as a valuable final check. The AI can identify potential errors, inconsistencies, or missed deductions that human experts might overlook, potentially leading to thousands of dollars in savings.

Cash received is not revenue earned

Complex Systems with Patrick McKenzie (patio11)·3 months ago

OpenAI's Codex Tech Lead Writes Only 10% of His Code; AI Generates the Rest

Michael Bolin, a tech lead on OpenAI's Codex, says models now generate 80-90% of his code. He reserves manual coding for critical, low-level tasks like security sandboxing. For most work, including debugging and refactoring, he relies on the AI agent to maximize his throughput.

OpenAI Codex Tech Lead On How His Career Grew And How He Uses Codex | Michael Bolin

The Peterman Pod·4 months ago

Hyper-Niche AI Tools Like a 'K1 Tax Form Finder' Are Untapped Goldmines

Inspired by standalone sites like bankstatementconverter.com, a major opportunity in the ChatGPT store is building apps that solve highly specific, painful business problems. An app that automatically finds all K1 tax forms in a user's Gmail is a prime example of a simple tool with massive value for a specific audience.

5 App Ideas for ChatGPT’s New App Store ft. Greg Isenberg

My First Million·9 months ago

OpenAI PMs Use Codex to Synthesize Scattered Dashboards, Not Just Automate

The key value of Codex for a growth PM at OpenAI wasn't just viewing a single dashboard, but building a unified web app that pulls from multiple scattered sources (Databricks, Tableau). This combines data synthesis with a TLDR summary, overcoming cognitive overload.

How to Use Codex Like an OpenAI PM | Abhi Muchhal, PM OpenAI (ex-Meta and Nubank)

The Growth Podcast·2 months ago

Journalists Can Use LLMs as a Final Fact-Checking Layer to Catch Mistakes

Journalist Casey Newton uses AI tools not to write his columns, but to fact-check them after they're written. He finds that feeding his completed text into an LLM is a surprisingly effective way to catch factual errors, a significant improvement in model capability over the past year.

The Death of the Tech Conference, Jake Paul Joins, Dimon Launches Deregulation Blitz | Jake Paul & Geoffrey Woo, Matt Pavelle, David Senra & Lulu Cheng, Casey Newton, Alex Epstein, Jamie Siminoff

TBPN·6 months ago

Use AI Models Like Claude to Proofread and Verify Data in Long Reports

The podcast team used Claude Code to cross-check every number and chart in a 50+ page report against the source data, as well as proofread the text. This is a powerful use case for AI in tedious verification tasks where human attention wanes and errors can easily slip through.

#214: Musk v. OpenAI Round 2, Coinbase AI Layoffs, AI “Soft Nationalization & xAI Folds Into SpaceX

The Artificial Intelligence Show·2 months ago

AI's Hidden Value is Fact-Checking Human Accountants and Corporate Press Releases for Errors

Instead of solely focusing on AI fallibility, a major application is using AI agents to audit human work. Perplexity's "Final Pass" feature analyzes documents for factual errors and internal inconsistencies, finding glaring mistakes in things like Gartner's earnings press releases and work done by professional accountants.

AI Agents: Mirage Or Real Revolution? — With Dmitry Shevelenko

Big Technology Podcast·2 months ago

OpenAI Merges Codex into ChatGPT Because Its Co-Developed Software "Harness" Is Superior

OpenAI is combining Codex with ChatGPT, recognizing that the software "harness" enabling Codex's actions is more effective for all knowledge work tasks. This success stems from building the model and its action-taking software together in one team, a key lesson for developing capable AI agents.

Microsoft’s Homegrown AI Models, Trump’s AI Executive Order, OpenAI to Merge Codex & ChatGPT

The Information's TITV·2 months ago

An OpenAI Team Built a Million-Line App Without Writing Any Code

An OpenAI team developed an internal application with one million lines of code, all generated by an AI agent. Engineers were forbidden from writing code directly, instead shifting their role to diagnosing AI failures and improving the underlying system to prevent repeat mistakes.

How PMs Ship 100K Lines of Code at OpenAI with Ryan Lopopolo, Member of Technical Staff

The Growth Podcast·2 months ago

An Ex-Google Analyst's $300k Workflow: Turn a CSV into a Leadership Deck in Hours

An ex-Google data analyst demonstrates using OpenAI's Codex to analyze a CSV file of customer data. She prompts the AI to perform a root cause and cohort analysis for a retention drop, then automatically generates a leadership presentation, condensing a multi-day task into a two-hour project.

Google Data Analyst Shares Her $300k/year Codex Workflow

Marketing Against The Grain·2 months ago

Get your free personalized podcast brief

Related Insights