/
© 2026 RiffOn. All rights reserved.
  1. The AI Daily Brief: Artificial Intelligence News and Analysis
  2. How People Actually Use AI Agents
How People Actually Use AI Agents

How People Actually Use AI Agents

The AI Daily Brief: Artificial Intelligence News and Analysis · Feb 19, 2026

Anthropic's new study shows real-world AI agent use is defined by human interaction and trust, not just raw model capability.

AI Model Benchmarks Are Increasingly Unreliable Due to Widespread "Training to the Test"

The gap between benchmark scores and real-world performance suggests labs achieve high scores by distilling superior models or training for specific evals. This makes benchmarks a poor proxy for genuine capability, a skepticism that should be applied to all new model releases.

How People Actually Use AI Agents thumbnail

How People Actually Use AI Agents

The AI Daily Brief: Artificial Intelligence News and Analysis·2 days ago

Google's Lyria AI Music Tool Targets Social Expression, Not Professional Creation

Google is positioning its Lyria music generator not as a competitor to sophisticated tools like Suno, but as a fun, social feature for quick, expressive clips in Gemini and YouTube Shorts. This focuses on a more casual use case, sidestepping direct comparison on musical complexity.

How People Actually Use AI Agents thumbnail

How People Actually Use AI Agents

The AI Daily Brief: Artificial Intelligence News and Analysis·2 days ago

Anthropic's "Walled Garden" Policy Backlash Highlights an Existing Industry Norm

User outrage over Anthropic restricting personal account usage for third-party tools missed that competitors like Google and OpenAI already had similar policies. This shows Anthropic was aligning with an established trend towards closed ecosystems, not pioneering an unpopular one.

How People Actually Use AI Agents thumbnail

How People Actually Use AI Agents

The AI Daily Brief: Artificial Intelligence News and Analysis·2 days ago

Anthropic's AI Agent Seeks Clarification More Often Than Its Human Users Intervene

On complex tasks, the Claude agent asks for clarification more than twice as often as humans interrupt it. This challenges the narrative of needing to constantly correct an overconfident AI; instead, the model self-regulates by identifying ambiguity to ensure alignment before proceeding.

How People Actually Use AI Agents thumbnail

How People Actually Use AI Agents

The AI Daily Brief: Artificial Intelligence News and Analysis·2 days ago

Anthropic's Data Shows Half of Agent Tool Calls Are Already Non-Engineering Tasks

Even within the code-centric Claude Code environment, nearly 50% of agentic tasks are for business functions like back-office automation, sales, and marketing. This is a strong leading indicator that agentic AI is rapidly expanding beyond its initial software development niche.

How People Actually Use AI Agents thumbnail

How People Actually Use AI Agents

The AI Daily Brief: Artificial Intelligence News and Analysis·2 days ago

Expert AI Agent Users Grant More Autonomy While Simultaneously Intervening More Often

Anthropic's study reveals a paradox: expert users grant AI agents more freedom via auto-approval while also interrupting them more frequently. This suggests mastery involves active, targeted supervision to guide the agent, not a passive "set it and forget it" approach.

How People Actually Use AI Agents thumbnail

How People Actually Use AI Agents

The AI Daily Brief: Artificial Intelligence News and Analysis·2 days ago