Skip to main content

Your favorite AI chatbot might not be telling the truth

AI chatbots.
PIxabay

AI search tools are becoming more popular, with one in four Americans reporting using AI instead of traditional search engines. However, here’s an important note: these AI chatbots do not always provide accurate information.

A recent study by the Tow Center for Digital Journalism, reported by Columbia Journalism Review, indicates that chatbots struggle to retrieve and cite news content accurately. Even more concerning is their tendency to invent information when they do not have the correct answer.

Recommended Videos

AI chatbots tested for the survey included many of the “best,” including ChatGPT, Perplexity, Perplexity Pro, DeepSeek, Microsoft’s Copilot, Grok-2, Grok-3, and Google Gemini.

In the tests, AI chatbots were given direct excerpts from 10 online articles published by various outlets. Each chatbot received 200 queries, representing 10 articles across 20 different publishers, for 1,600 queries. The chatbots were asked to identify the headline of each article, its original publisher, publication date, and URL.

Similar tests conducted with traditional search engines successfully provided the correct information. However, the AI chatbots did not perform as well.

The findings indicated that chatbots often struggle to decline questions they cannot answer accurately, frequently providing incorrect or speculative responses instead. Premium chatbots tend to deliver confidently incorrect answers more often than their free counterparts. Additionally, many chatbots appeared to disregard the Robot Exclusion Protocol (REP) preferences, which websites use to communicate with web robots like search engine crawlers.

The survey also found that generative search tools were prone to fabricating links and citing syndicated or copied versions of articles. Moreover, content licensing agreements with news sources did not guarantee accurate citations in chatbot responses.

What can you do?

What stands out most about the results of this survey is not just that AI chatbots often provide incorrect information but that they do so with alarming confidence. Instead of admitting they don’t know the answer, they tend to respond with phrases like “it appears,” “it’s possible,” or “might.”

For instance, ChatGPT incorrectly identified 134 articles yet only signaled uncertainty 15 times out of 200 responses and never refrained from providing an answer.

Based on the survey results, it’s probably wise not to rely exclusively on AI chatbots for answers. Instead, a combination of traditional search methods and AI tools is recommended. At the very least, using multiple AI chatbots to find an answer may be beneficial. Otherwise, you risk obtaining incorrect information.

Looking ahead, I wouldn’t be surprised to see a consolidation of AI chatbots as the better ones stand out from the poor-quality ones. Eventually, their results will be as accurate as those from traditional search engines. When that will happen is anyone’s guess.

Bryan M. Wolfe
Former Mobile and A/V Freelancer
Bryan M. Wolfe has over a decade of experience as a technology writer. He writes about mobile.
Meta made insane offers in bid to nab OpenAI talent, Altman claims
OpenAI CEO Sam Altman during the Uncapped podcast in June 2025.

OpenAI chief Sam Altman has said that Meta tried to tempt his top AI researchers to switch sides by offering hiring bonuses of $100 million. Yes, you read that right -- $100 million. Altman said that up to now, none of his top team have left for Mark Zuckerberg's Meta.

Altman made the claim on Tuesday in the Uncapped podcast, hosted by his brother, Jack.

Read more
ChatGPT was down: how the June 10 OpenAI outage unfolded
AI assistant ChatGPT and image creator Sora were down as part of a major OpenAI outage
ChatGPT logo on a phone

The popular AI assistant ChatGPT, and image generator Sora, suffered significant downtime as part of a major OpenAI outage today, June 10.

Downdetector showed reports regarding a ChatGPT outage started shortly before 12am PDT overnight and into June 10. This wasn't the first time we've seen ChatGPT go down, with an outage also occurring back in December 2024.

Read more
Apple needs to fix the basics for macOS 26, or let AI run the show
Background apps on M4 MacBook Air.

The Mac apps community is a wonderful place to find utilities that can supercharge your computing experience. Alfred, Raycast, AlDente, and Rectangle are some of the most highly recommended apps for macOS users these days. The open-source community has also produced a few utilities (and their forks) that I use on a daily basis. 

If you read between the lines, you'll notice that these apps fill a functional gap that Apple has yet to offer natively. On the other side of the computing ecosystem, Windows has served those perks for years. Will the next big software upgrade, macOS 26, finally give users an in-house fix? We’ll only get the answer at WWDC 2025 in just over a week from now. 

Read more