On Thursday, Anthropic launched Claude Opus 4.8, the latest and most advanced version of its flagship AI model. It’s ...
The new Claude Opus 4.8 is a "modest but tangible improvement," but a Mythos model you can use may be just weeks away.
Opus 4.8 shows a growing tendency to reason explicitly about how its outputs will be graded, including in environments where it wasn't told it was being evaluated.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results