Tag: benchmarks
All the articles with the tag "benchmarks".
-
Claude Sonnet 4.6: Opus-Level Performance at 1/5 the Cost
Anthropic's Sonnet 4.6 closes the gap with Opus on coding and computer use benchmarks while staying at $3/$15M tokens. Here's what the numbers mean for builders.