/

© 2026 RiffOn. All rights reserved.

Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

How I AI
Claude Opus 4.8 is here. Is it as good as they say?

Claude Opus 4.8 is here. Is it as good as they say?

How I AI · May 28, 2026

First look at Anthropic's Claude Opus 4.8: Impressive on greenfield coding but struggles with edge cases, strategy, and factual grounding.

Anthropic's Opus 4.8 Excels at Initial Tasks but Fails on the Final 10% Details

The model performs impressively on one-shot, greenfield projects but struggles with the critical final details and edge cases. When pushed to refine or iterate on a task, it begins to introduce bugs and loses consistency, revealing a significant weakness in handling sustained complexity.

Claude Opus 4.8 is here. Is it as good as they say? thumbnail

Claude Opus 4.8 is here. Is it as good as they say?

How I AI·2 months ago

Opus 4.8 Misses the "Forest for the Trees" by Over-Indexing on Small Data Points

The model has "narrow vision," latching onto specific data or code points and treating them as definitive truth without broader context. This leads to flawed conclusions in both strategic analysis and coding, as it fails to contextualize information or zoom out to see the bigger picture.

Claude Opus 4.8 is here. Is it as good as they say? thumbnail

Claude Opus 4.8 is here. Is it as good as they say?

How I AI·2 months ago

Anthropic's Opus 4.8 Reintroduces Confident Hallucinations When Bug Hunting

Despite advancements, the model exhibits a surprising tendency to hallucinate. When investigating bugs or validating information, it confidently presents hypotheses as facts without grounding them in data. This is a significant reliability issue, especially for a model marketed as "more honest."

Claude Opus 4.8 is here. Is it as good as they say? thumbnail

Claude Opus 4.8 is here. Is it as good as they say?

How I AI·2 months ago

Opus 4.8 Lacks Ambition for Complex, Agentic Coding Tasks

Despite its capabilities, the model produces uninspired and safe outputs when prompted for ambitious, "state-of-the-art" agentic coding projects. It delivers serviceable code but fails to push creative boundaries or think expansively, falling short of its "10x agentic coding" potential.

Claude Opus 4.8 is here. Is it as good as they say? thumbnail

Claude Opus 4.8 is here. Is it as good as they say?

How I AI·2 months ago

Anthropic's Opus 4.7 Outperforms the Newer 4.8 Model on Business Strategy Tasks

In a direct comparison, the older Opus 4.7 model proved superior for business strategy. It produced structured, data-anchored analysis, whereas Opus 4.8 was "handwavy," struggled to find relevant data, and over-rotated on minor data points, leading to weaker strategic recommendations.

Claude Opus 4.8 is here. Is it as good as they say? thumbnail

Claude Opus 4.8 is here. Is it as good as they say?

How I AI·2 months ago