Contrary to the vision of free-wheeling autonomous agents, most business automation relies on strict Standard Operating Procedures (SOPs). Products like OpenAI's Agent Builder succeed by providing deterministic, node-based workflows that enforce business logic, which is more valuable than pure autonomy.

Related Insights

Don't expect an AI agent to invent a successful sales process. First, have your human team identify and document what works—effective emails, scripts, and objection handling. Then, train the AI on this proven playbook to execute it flawlessly and at scale. The AI is a scaling tool, not a strategist from day one.

The evolution of 'agentic AI' extends beyond content generation to automating the connective tissue of business operations. Its future value is in initiating workflows that span departments, such as kickstarting creative briefs for marketing, creating product backlogs from feedback, and generating service tickets, streamlining operational handoffs.

True Agentic AI isn't a single, all-powerful bot. It's an orchestrated system of multiple, specialized agents, each performing a single task (e.g., qualifying, booking, analyzing). This 'division of labor,' mirroring software engineering principles, creates a more robust, scalable, and manageable automation pipeline.

Building features like custom commands and sub-agents can look like reliable, deterministic workflows. However, because they are built on non-deterministic LLMs, they fail unpredictably. This misleads users into trusting a fragile abstraction and ultimately results in a poor experience.

Training AI agents to execute multi-step business workflows demands a new data paradigm. Companies create reinforcement learning (RL) environments—mini world models of business processes—where agents learn by attempting tasks, a more advanced method than simple prompt-completion training (SFT/RLHF).

Treating AI evaluation like a final exam is a mistake. For critical enterprise systems, evaluations should be embedded at every step of an agent's workflow (e.g., after planning, before action). This is akin to unit testing in classic software development and is essential for building trustworthy, production-ready agents.

Despite marketing hype, current AI agents are not fully autonomous and cannot replace an entire human job. They excel at executing a sequence of defined tasks to achieve a specific goal, like research, but lack the complex reasoning for broader job functions. True job replacement is likely still years away.

Contrary to the view that useful AI agents are a decade away, Andrew Ng asserts that agentic workflows are already solving complex business problems. He cites examples from his portfolio in tariff compliance and legal document processing that would be impossible without current agentic AI systems.

Vercel's CTO Malte Ubl notes that durable, resumable workflows are not a new invention for AI agents. Instead, they are a fundamental computer science concept that has been implemented ad-hoc in every transactional system, from banking in the 70s to modern tech giants, just without a standardized abstraction.

Instead of focusing on foundational models, software engineers should target the creation of AI "agents." These are automated workflows designed to handle specific, repetitive business chores within departments like customer support, sales, or HR. This is where companies see immediate value and are willing to invest.