Skip to main content

Your favorite AI chatbot might not be telling the truth

AI chatbots.
PIxabay

AI search tools are becoming more popular, with one in four Americans reporting using AI instead of traditional search engines. However, here’s an important note: these AI chatbots do not always provide accurate information.

A recent study by the Tow Center for Digital Journalism, reported by Columbia Journalism Review, indicates that chatbots struggle to retrieve and cite news content accurately. Even more concerning is their tendency to invent information when they do not have the correct answer.

Recommended Videos

AI chatbots tested for the survey included many of the “best,” including ChatGPT, Perplexity, Perplexity Pro, DeepSeek, Microsoft’s Copilot, Grok-2, Grok-3, and Google Gemini.

In the tests, AI chatbots were given direct excerpts from 10 online articles published by various outlets. Each chatbot received 200 queries, representing 10 articles across 20 different publishers, for 1,600 queries. The chatbots were asked to identify the headline of each article, its original publisher, publication date, and URL.

Similar tests conducted with traditional search engines successfully provided the correct information. However, the AI chatbots did not perform as well.

The findings indicated that chatbots often struggle to decline questions they cannot answer accurately, frequently providing incorrect or speculative responses instead. Premium chatbots tend to deliver confidently incorrect answers more often than their free counterparts. Additionally, many chatbots appeared to disregard the Robot Exclusion Protocol (REP) preferences, which websites use to communicate with web robots like search engine crawlers.

The survey also found that generative search tools were prone to fabricating links and citing syndicated or copied versions of articles. Moreover, content licensing agreements with news sources did not guarantee accurate citations in chatbot responses.

What can you do?

What stands out most about the results of this survey is not just that AI chatbots often provide incorrect information but that they do so with alarming confidence. Instead of admitting they don’t know the answer, they tend to respond with phrases like “it appears,” “it’s possible,” or “might.”

For instance, ChatGPT incorrectly identified 134 articles yet only signaled uncertainty 15 times out of 200 responses and never refrained from providing an answer.

Based on the survey results, it’s probably wise not to rely exclusively on AI chatbots for answers. Instead, a combination of traditional search methods and AI tools is recommended. At the very least, using multiple AI chatbots to find an answer may be beneficial. Otherwise, you risk obtaining incorrect information.

Looking ahead, I wouldn’t be surprised to see a consolidation of AI chatbots as the better ones stand out from the poor-quality ones. Eventually, their results will be as accurate as those from traditional search engines. When that will happen is anyone’s guess.

Bryan M. Wolfe
Bryan M. Wolfe has over a decade of experience as a technology writer. He writes about mobile.
Notepad will soon automatically summarize your notes with AI
Image showing Notepad with highlighted text and a dialogue with the results of using the summarize feature to summarize the text.

Notepad, one of the iconic features of Windows, might seem like just about the most basic software there is. You type, the letters appear in plain text, and that's it. No formatting, no links, no other word processing features -- but that is about to change. Microsoft has began testing an AI feature for Notepad which summarizes portions of your text documents.

The new feature is part of Notepad version 11.2501.29.0, which is now rolling out to Windows Insiders -- users who have signed up to Microsoft's early access program. The way it works is simple: you highlight a block of text, right click on it, and select Summarize. Alternatively, you can also press Ctrl + M. That will bring up an AI-generated summary of the text, and you can adjust the length of the summary to suit your needs.

Read more
Why OpenAI’s copyright plan will impact you more than you think
Depiction of OpenAI Sora video generator on a phone.

OpenAI is inconsistent in a lot of things -- is it a non-profit or a for-profit? Is Sam Altman fit to be CEO or not? But one thing the company has always been consistent about is its belief that it requires access to copyrighted material for AI training. Now, despite the many voices that disagree, OpenAI wants the U.S. government to approve such unrestricted access by ruling it as "fair use."

The company argues that the U.S. will fall behind China in the AI race if companies don't have the freedom to train their models on copyrighted material -- claiming that "overly burdensome state laws" will slow the process and affect results.

Read more
Google is giving free access to two of Gemini’s best AI features
Gemini Advanced on the Google Pixel 9 Pro Fold.

Google’s Gemini AI has steadily made its way to the best of its software suite, from native Android integrations to interoperability with Workspace apps such as Gmail and Docs. However, some of the most advanced Gemini features have remained locked behind a subscription paywall.
That changes today. Google has announced that Gemini Deep Research will now be available for all users to try, alongside the ability to create custom Gem bots. You no longer need a Gemini Advanced (or Google One AI Premium) subscription to use the aforementioned tools.

The best of Gemini as an AI agent
Deep Research is an agentic tool that takes over the task of web research, saving users the hassle of visiting one web page after another, looking for relevant information. With Deep Research, you can simply put a natural language query as input, and also specify the source, if needed.

Read more