Curated RSS Brief
Claude’s new model is more ‘honest’ when it messes up
AI Close AI Posts from this topic will be added to your daily email digest and your homepage feed. Follow Follow See All AI News Close News Posts from this topic will be added to your daily email digest and your homepage feed. Follow Follow See All News Anthropic Close Anthropic Posts from this topic will be added to your daily email digest and your homepage feed. Follow Follow See All Anthropic Claude’s new model is more ‘honest’ when it messes up Opus 4.8 is launching on Thursday. Opus 4.8 is launching on Thursday. by Jay Peters Close Jay Peters Senior Reporter Posts from this author will be added to your daily email digest and your homepage feed. Follow Follow See All by Jay Peters May 28, 2026, 5:00 PM UTC Link Share Gift Image: Cath Virginia / The Verge, Getty Images Jay Peters Close Jay Peters Posts from this author will be added to your daily email digest and your homepage feed. Follow Follow See All by Jay Peters is a senior reporter covering technology, gaming, and more. He joined The Verge in 2019 after nearly two years at Techmeme. Anthropic is releasing Claude Opus 4.8 on Thursday, and the company is touting the model’s “honesty.” According to Anthropic , it trains “all [its] models to be honest — for instance, to avoid making claims that they can’t support.” But it notes that “a general problem with AI models is that they sometimes jump to conclusions, confidently presenting their work as making progress despite thin evidence.” The AI lab claims that early testers have found that Opus 4.8 “is more likely to flag uncertainties about its work and less likely to make unsupported claims.” In the company’s evaluations, Opus 4.8 is “around 4x less likely than its predecessor to allow flaws in code it’s written to pass unremarked.” In addition to the honesty improvements, with Opus 4.8, users can direct the amount of effort Claude puts into a task. Higher-effort responses will use more tokens, giving users the option of lower-effort responses if they don’t want to burn through their rate limits as quickly. Anthropic is also launching a feature called “dynamic workflows” in research preview, which the company says will let Claude “take on even bigger tasks.” With dynamic workflows, “Claude can plan the work and then run hundreds of parallel subagents in a single session (and with Opus 4.8, the agents can run for even longer). It then verifies its outputs before reporting back to the user.” Follow topics and authors from this story to see more like this in your personalized homepage feed and to receive email updates. Jay Peters Close Jay Peters Senior Reporter Posts from this author will be added to your daily email digest and your homepage feed. Follow Follow See All by Jay Peters AI Close AI Posts from this topic will be added to your daily email digest and your homepage feed. Follow Follow See All AI Anthropic Close Anthropic Posts from this topic will be added to your daily email digest and your homepage feed. Follow Follow See All Anthropic News Close News Posts from this topic will be added to your daily email digest and your homepage feed. Follow Follow See All News Most Popular Most Popular Kia’s flagship EV has a battery problem The golden age of handheld gaming is already over Valve raises Steam Deck prices by more than $200 They’ve finally made the Oura Ring smaller and lighter What’s next for Microsoft’s Surface PCs? The Verge Daily A free daily digest of the news that matters most. Email (required) Sign Up By submitting your email, you agree to our Terms and Privacy Notice . This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply. Advertiser Content From This is the title for the native ad
- AI Close AI Posts from this topic will be added to your daily email digest and your homepage feed.
- Follow Follow See All AI News Close News Posts from this topic will be added to your daily email digest and your homepage feed.
- Follow Follow See All News Anthropic Close Anthropic Posts from this topic will be added to your daily email digest and your homepage feed.
- Follow Follow See All Anthropic Claude’s new model is more ‘honest’ when it messes up Opus 4.8 is launching on Thursday.
If you want the exact wording, examples, or full context from the publisher, open the original source article.
Open Original Article
Comments
Post a Comment