OpenAI's Deep Research Uses a Hybrid "Agentic Workflow" to Mitigate Risk Before Execution

Related Insights

Build Reliable AI Agents by Gradually Increasing Autonomy, Not Launching Fully Autonomous

To avoid failure, launch AI agents with high human control and low agency, such as suggesting actions to an operator. As the agent proves reliable and you collect performance data, you can gradually increase its autonomy. This phased approach minimizes risk and builds user trust.

What OpenAI and Google engineers learned deploying 50+ AI products in production

Lenny's Podcast: Product | Career | Growth·a month ago

Real-World AI Agents Require Deterministic Workflows, Not Full Autonomy

Contrary to the vision of free-wheeling autonomous agents, most business automation relies on strict Standard Operating Procedures (SOPs). Products like OpenAI's Agent Builder succeed by providing deterministic, node-based workflows that enforce business logic, which is more valuable than pure autonomy.

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

a16z Podcast·3 months ago

The "Year of the Agent" is a Decade-Long Journey; Use "Agentic Workflows" Today

Fully autonomous agents are not yet reliable for complex production use cases because accuracy collapses when chaining multiple probabilistic steps. Zapier's CEO recommends a hybrid "agentic workflow" approach: embed a single, decisive agent within an otherwise deterministic, structured workflow to ensure reliability while still leveraging LLM intelligence.

INSIDE How AI Startups hire, AI Roundtable with Wade Foster, Mikey Schulman, and Ali Ansari | E2225

This Week in Startups·2 months ago

Differentiate AI Systems by Agency: Workflows are Deterministic, Agents Choose Their Own Tools

An AI agent uses an LLM with tools, giving it agency to decide its next action. In contrast, a workflow is a predefined, deterministic path where the LLM's actions are forced. Most production AI systems are actually workflows, not true agents.

959: Building Agents 101: Design Patterns, Evals and Optimization (with Sinan Ozdemir)

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

Evaluate Each Step in an Agentic Workflow, Not Just the Final Output

Treating AI evaluation like a final exam is a mistake. For critical enterprise systems, evaluations should be embedded at every step of an agent's workflow (e.g., after planning, before action). This is akin to unit testing in classic software development and is essential for building trustworthy, production-ready agents.

AI Agents for PMs in 69 Minutes — Masterclass with IBM VP

Product Growth Podcast·5 months ago

Tasklet Bets on Open-Ended Agents Over Rigid Workflows for Greater Reliability

Tasklet's CEO argues that while traditional workflow automation seems safer, agentic systems that let the model plan and execute will ultimately prove more robust. They can handle unexpected errors and nuance that break rigid, pre-defined workflows, a bet on future model improvements.

Always Bet on the Models: How Tasklet Puts the Agency in Agents, with CEO Andrew Lee

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

De-Risk Enterprise AI Rollouts by First Assisting Human Agents Before Customer-Facing Deployment

To mitigate risks like AI hallucinations and high operational costs, enterprises should first deploy new AI tools internally to support human agents. This "agent-assist" model allows for monitoring, testing, and refinement in a controlled environment before exposing the technology directly to customers.

#785: Avaya CTO David Funck on building persistent memory of the customer with AI

The Agile Brand with Greg Kihlström®: Expert Mode Marketing Technology, AI, & CX·2 months ago

Agentic AI's Key Barrier is the Gap Between 'Knowing' and 'Doing'

While AI models excel at gathering and synthesizing information ('knowing'), they are not yet reliable at executing actions in the real world ('doing'). True agentic systems require bridging this gap by adding crucial layers of validation and human intervention to ensure tasks are performed correctly and safely.

44: How AI Agents Could Change the Way You Shop Forever (with Grace Wu)

AI Product Leader·5 months ago

Enterprise AI Agents Require "Semi-Determinism" to Mitigate Production Risks

Fully autonomous AI agents are not yet viable in enterprises. Alloy Automation builds "semi-deterministic" agents that combine AI's reasoning with deterministic workflows, escalating to a human when confidence is low to ensure safety and compliance.

Stop ghosting your friends with Nox’s RPLY, plus Alloy Automation and a Shopify flashback | E2209

This Week in Startups·3 months ago

A "Research, Plan, Implement" Workflow Unlocks AI in Complex Codebases

To get AI agents to perform complex tasks in existing code, a three-stage workflow is key. First, have the agent research and objectively document how the codebase works. Second, use that research to create a step-by-step implementation plan. Finally, execute the plan. This structured approach prevents the agent from wasting context on discovery during implementation.

From Chaos to Code: HumanLayer’s Playbook for Agent-Driven Dev

The Lobster Talks Podcast by Lobster Capital·5 months ago