Skip to main content

GPT-4o and Gemini 1.5 Pro just got beat in the AI race

a screenshot of claude 3.5 sonnet, with an 8-bit crab
Anthropic

There’s a new leader, technically, in the race for AI assistant dominance, and it’s Anthropic’s new Claude 3.5 Sonnet. The newly released model outperforms both Gemini 1.5 Pro and ChatGPT-4o across a spectrum of benchmark tests, the company announced on Thursday.

This new iteration of Sonnet is the first in Anthropic’s upcoming line of 3.5 models, and it significantly outperforms the more expansive Opus 3.0 model, and does so at a fraction of the larger model’s energy cost. Compute efficiency is becoming an increasingly important aspect of AI system design, especially as the cost of both powering and cooling AI data centers soars while the infrastructure pushes into the gigawatt range.

Claude 3.5 Sonnet for vision

“Claude 3.5 Sonnet operates at twice the speed of Claude 3 Opus,” the Anthropic team wrote in a blog post. “This performance boost, combined with cost-effective pricing, makes Claude 3.5 Sonnet ideal for complex tasks such as context-sensitive customer support and orchestrating multistep workflows.”

Recommended Videos

The new model has reportedly set benchmark results across three standardized tests: graduate-level reasoning with GPQA, undergraduate-level knowledge with MMLU, and coding proficiency with HumanEval. It beat out Google’s Gemini 1.5 Pro, Meta’s Llama-400b, and OpenAI’s ChatGPT-4o, though not by any huge margin and typically only by a couple percentage points.

A table showing Claude 3.5 Sonnet's performance compared to other leading AI systems.
Anthropic

Sonnet 3.5 is being billed as Anthropic’s “strongest vision model yet. ” It’s capable of performing a number of vision-based tasks — like interpreting charts and graphs or transcribing text from imperfect image sources like screenshots or scanned receipts — more accurately than Opus 3.0. In fact, Sonnet 3.5 beat out Opus 3.0 by anywhere from 6 to 17 points across industry standard vision benchmarks. The new model is also reportedly much more competent at handling humor and can converse in a much more lifelike manner.

Sonnet will also be the first Anthropic AI to offer the Artifacts feature to users. Rather than generate images or code snippets directly into the flow of the conversation, Artifacts will create that content in a dedicated space to the side of the chat. This allows users to create “a dynamic workspace where they can see, edit, and build upon Claude’s creations in real time, seamlessly integrating AI-generated content into their projects and workflows,” the Anthropic team claims. It also announced that Claude will soon support team collaboration wherein a company can store its data, documents and projects in a single, central silo, with Claude acting as an on-demand assistant.

You can try out Claude 3.5 Sonnet today for free on the Claude.ai website and the Claude iOS app (a Claude Pro or Team subscription will garner you significantly higher rate limits). Third-party integration is also available through the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI. Claude Haiku 3.5 and Opus 3.5 are scheduled for release later in the year.

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
Presidents’ Day Dell Deals: XPS, G16, monitors and more on sale
The Dell XPS 14 open on a wooden table.

Presidents' Day is a nice three-day reprieve from work, and it's also a nice excuse to do some shopping. And Dell is certainly ready, with business laptops, monitors, and more discounted on their website and across Amazon. We've picked out our favorite deals, largely from the best Dell products out there -- and products we've personally reviewed or have hands-on experiences with. Here, we present that list to you so you can get some of the best laptop deals and monitor deals around. Remember that as these deals are coming out around the Presidents' Day holiday (though not all of them have explicit "Presidents' Day" markings) they very well might end soon, so plan your purchases accordingly.
Dell S2425HS Monitor — $110 $140 21% off

This sleek monitor with a modern look has integrated speakers, a 100Hz refresh rate, and a 4-star TÜV Rheinland eye comfort rating. The 24-inch Dell S2425HS is a great second monitor for your home office or second study. You won't find many monitor deals with a price lower than the starting price of $140 that this one sports, much less the reduced $110.

Read more
1Password vs. NordPass: which password manager is best in 2025?
1Password and NordPass reviews appear beside one another on a PC monitor.

1Password and NordPass are among the most popular and best password managers available. Both offer significant improvements over the built-in solutions you get from Microsoft, Apple, and Google, making it hard to choose between them.

I've reviewed the latest versions of 1Password and NordPass in 2025 and can share some insights into the differences and compare prices to help you discover which offers the best value for you.
Specs

Read more
This iBuyPower gaming PC with RTX 4060 is under $1,000 — for now
The iBUYPOWER Trace 7 Mesh gaming desktop on a white background.

Gaming PC deals worth buying still usually cost more than $1,000 after the discounts, but here's an offer from Best Buy that's available for a more affordable price. The iBuyPower Trace 7 Mesh, which is originally sold for $1,300, is down to just $900 following a $400 discount. We're not sure how much time is remaining before this bargain ends, so if you're interested in this gaming desktop, you need to push forward with your purchase immediately if you want to secure the savings.

Why you should buy the iBuyPower Trace 7 Mesh gaming PC
The iBuyPower Trace 7 Mesh is a relatively affordable gaming PC, but it doesn't sacrifice much in terms of performance. It runs on the AMD Ryzen 7 5700 processor and the Nvidia GeForce RTX 4060, which is in our list of the best graphics cards as our recommendation for 1080p gaming. It has 16GB of RAM, which is the best place to start for a gaming PC, according to our guide on how much RAM do you need. With these components, you won't have trouble playing the best PC games, though you'll have to go with medium settings for the more demanding titles.

Read more