Skip to main content

GPT-4o and Gemini 1.5 Pro just got beat in the AI race

a screenshot of claude 3.5 sonnet, with an 8-bit crab
Anthropic

There’s a new leader, technically, in the race for AI assistant dominance, and it’s Anthropic’s new Claude 3.5 Sonnet. The newly released model outperforms both Gemini 1.5 Pro and ChatGPT-4o across a spectrum of benchmark tests, the company announced on Thursday.

This new iteration of Sonnet is the first in Anthropic’s upcoming line of 3.5 models, and it significantly outperforms the more expansive Opus 3.0 model, and does so at a fraction of the larger model’s energy cost. Compute efficiency is becoming an increasingly important aspect of AI system design, especially as the cost of both powering and cooling AI data centers soars while the infrastructure pushes into the gigawatt range.

Claude 3.5 Sonnet for vision

“Claude 3.5 Sonnet operates at twice the speed of Claude 3 Opus,” the Anthropic team wrote in a blog post. “This performance boost, combined with cost-effective pricing, makes Claude 3.5 Sonnet ideal for complex tasks such as context-sensitive customer support and orchestrating multistep workflows.”

The new model has reportedly set benchmark results across three standardized tests: graduate-level reasoning with GPQA, undergraduate-level knowledge with MMLU, and coding proficiency with HumanEval. It beat out Google’s Gemini 1.5 Pro, Meta’s Llama-400b, and OpenAI’s ChatGPT-4o, though not by any huge margin and typically only by a couple percentage points.

A table showing Claude 3.5 Sonnet's performance compared to other leading AI systems.
Anthropic

Sonnet 3.5 is being billed as Anthropic’s “strongest vision model yet. ” It’s capable of performing a number of vision-based tasks — like interpreting charts and graphs or transcribing text from imperfect image sources like screenshots or scanned receipts — more accurately than Opus 3.0. In fact, Sonnet 3.5 beat out Opus 3.0 by anywhere from 6 to 17 points across industry standard vision benchmarks. The new model is also reportedly much more competent at handling humor and can converse in a much more lifelike manner.

Sonnet will also be the first Anthropic AI to offer the Artifacts feature to users. Rather than generate images or code snippets directly into the flow of the conversation, Artifacts will create that content in a dedicated space to the side of the chat. This allows users to create “a dynamic workspace where they can see, edit, and build upon Claude’s creations in real time, seamlessly integrating AI-generated content into their projects and workflows,” the Anthropic team claims. It also announced that Claude will soon support team collaboration wherein a company can store its data, documents and projects in a single, central silo, with Claude acting as an on-demand assistant.

You can try out Claude 3.5 Sonnet today for free on the Claude.ai website and the Claude iOS app (a Claude Pro or Team subscription will garner you significantly higher rate limits). Third-party integration is also available through the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI. Claude Haiku 3.5 and Opus 3.5 are scheduled for release later in the year.

Andrew Tarantola
Andrew has spent more than a decade reporting on emerging technologies ranging from robotics and machine learning to space…
Best TP-Link Prime Day deals: routers and cameras on sale
The TP-Link Archer AXE7800 tri-band Wi-Fi 6E router in a room.

TP-Link is a trusted brand for routers and other smart home devices, and even though many Prime Day deals have officially ended, some TP-Link Prime Day deals are still available to claim. Whether you're looking for Prime Day router deals, smart light bulb deals, or Prime Day security camera deals, you should check out these remaining TP-Link Prime Day deals. Feel free to make a purchase quickly, as a deal could end at any moment and inventories are likely about out at this point.
Best TP-Link router Prime Day deals

If you're not satisfied with your current router's Wi-Fi coverage, you should take a look at the TP-Link router Prime Day deals that we've rounded up below. Whether you're thinking of upgrading to a better-performing router for your online games, or you want to set up a mesh system so that every corner of your home can get fast Wi-Fi, there's something for you from these offers, though we're not sure how much time is remaining before they expire.

Read more
Dell saved its best gaming laptop deal for after Prime Day
The Dell G16 gaming laptop.

Prime Day deals have already ended, but Dell keeps rolling out gaming laptop deals that catch our attention. Here's an offer that's going to be hard to refuse -- the Dell G16 gaming laptop with the Nvidia GeForce RTX 4060 graphics card for only $1,000, following a $500 discount on the device's original price of $1,500. We're not sure how much time is left before you miss the chance to take advantage of this bargain though, so we highly recommend proceeding with the purchase as soon as possible.

Why you should buy the Dell G16 gaming laptop
The Dell G16 gaming laptop is a fantastic choice for gamers who are sticking to a tight budget, because it will let you play the best PC games without breaking the bank. In addition to the Nvidia GeForce RTX 4060 graphics card, the device is powered by the 13th-generation Intel Core i9 processor and 16GB of RAM, which is the preferred minimum amount according to our gaming laptop buying guide. The Dell G16 also ships with a 1TB SSD, which is a lot of storage space for your video games, and with Windows 11 Home pre-loaded, you can start installing your favorite titles right after you unbox the gaming laptop.

Read more