Anthropic's 'Agent Teams' Feature Drives Massive Token Usage, Aligning Product with Business Model

Related Insights

Anthropic's Opus 4.6 API Unlocks 'Adaptive Thinking' to Control Model Effort Levels

A key new feature in the Opus 4.6 API is "Adaptive Thinking," which lets developers specify the level of effort the model applies to a task. Setting the effort to 'max' forces the model to think without constraints on depth, a powerful but resource-intensive option exclusive to the new version.

Claude Opus 4.6 vs GPT-5.3 Codex: Live Build, Clear Winner

The Startup Ideas Podcast·13 days ago

Anthropic's Creator Says Smarter AI Models Are Cheaper by Using Fewer Total Tokens

It's counterintuitive, but using a more expensive, intelligent model like Opus 4.5 can be cheaper than smaller models. Because the smarter model is more efficient and requires fewer interactions to solve a problem, it ends up using fewer tokens overall, offsetting its higher per-token price.

Claude Code's Creator Reveals "Claude Cowork"'s Setup

The Startup Ideas Podcast·a month ago

Token Efficiency Is a More Critical Metric Than Time for Advancing Long-Horizon AI Agents

Progress in complex, long-running agentic tasks is better measured by tokens consumed rather than raw time. Improving token efficiency, as seen from GPT-5 to 5.1, directly enables more tool calls and actions within a feasible operational budget, unlocking greater capabilities.

[State of Post-Training] From GPT-4.1 to 5.1: RLVR, Agent & Token Efficiency — Josh McGrath, OpenAI

Latent Space: The AI Engineer Podcast·2 months ago

Anthropic's New 'Agent Teams' Feature in Opus 4.6 Requires Manual Configuration to Activate

Many developers are failing to access key new features like "Agent Teams" in Anthropic's Opus 4.6. The issue is often a simple configuration oversight. You must manually enable experimental features in your settings.json file and ensure your packages are updated to leverage the model's full capabilities.

Claude Opus 4.6 vs GPT-5.3 Codex: Live Build, Clear Winner

The Startup Ideas Podcast·13 days ago

AI Costs Follow a "Smiling Curve": Unit Intelligence is Cheaper, but Total Spend Soars

A paradox exists where the cost for a fixed level of AI capability (e.g., GPT-4 level) has dropped 100-1000x. However, overall enterprise spend is increasing because applications now use frontier models with massive contexts and multi-step agentic workflows, creating huge multipliers on token usage that drive up total costs.

Artificial Analysis: The Independent LLM Analysis House — with George Cameron and Micah-Hill Smith

Latent Space: The AI Engineer Podcast·a month ago

Prompts Must Match AI's Architecture: Ask Opus to 'Build a Team,' Ask Codex to 'Think Deeply'

Effective prompting requires adapting your language to the AI's core design. For Anthropic's agent-based Opus 4.6, the optimal prompt is to "create an agent team" with defined roles. For OpenAI's monolithic Codex 5.3, the equivalent prompt is to instruct it to "think deeply" about those same roles itself.

Claude Opus 4.6 vs GPT-5.3 Codex: Live Build, Clear Winner

The Startup Ideas Podcast·13 days ago

Anthropic's Opus 4.6 Crushed OpenAI's Codex 5.3 in a Live App-Building Challenge

In a head-to-head test to build a Polymarket clone, Anthropic's Opus 4.6 produced a visually polished, feature-rich app. OpenAI's Codex 5.3 was faster but delivered a basic MVP that required multiple design revisions. The multi-agent "research first" approach of Opus resulted in a superior initial product.

Claude Opus 4.6 vs GPT-5.3 Codex: Live Build, Clear Winner

The Startup Ideas Podcast·13 days ago

AI Inference Costs Exhibit a "Smiling Curve": Per-Unit Intelligence is Cheaper, but Total Spend Soars

While the cost to achieve a fixed capability level (e.g., GPT-4 at launch) has dropped over 100x, overall enterprise spending is increasing. This paradox is explained by powerful multipliers: demand for frontier models, longer reasoning chains, and multi-step agentic workflows that consume exponentially more tokens.

Artificial Analysis: The Independent LLM Analysis House — with George Cameron and Micah Hill-Smith

Latent Space: The AI Engineer Podcast·a month ago

The Paradox of AI Costs: Per-Unit Intelligence is Plummeting While Overall Spend Skyrockets

While the cost for GPT-4 level intelligence has dropped over 100x, total enterprise AI spend is rising. This is driven by multipliers: using larger frontier models for harder tasks, reasoning-heavy workflows that consume more tokens, and complex, multi-turn agentic systems.

Artificial Analysis: The Independent LLM Analysis House — with George Cameron and Micah-Hill Smith

Latent Space: The AI Engineer Podcast·a month ago

Anthropic's Agent Skills Use 'Progressive Disclosure' to Minimize Token Costs

Agent Skills only load a skill's full instructions after user confirmation. This multi-phase flow avoids bloating the context window with unused tools, saving on token costs and improving performance compared to a single large system prompt.

Why Agent Skills Could Be the Most Practical Leap in Everyday AI

Machine Learning Tech Brief By HackerNoon·a month ago