Skip to main content

All of the internet now belongs to Google’s AI

Google Bard being shown off at Google I/O 2023.
Image used with permission by copyright holder

Google’s latest update to its privacy policy will make it so that the company has free range to scrape the web for any content that can benefit building and improving its AI tools.

Recommended Videos

“Google uses information to improve our services and to develop new products, features, and technologies that benefit our users and the public,” the new Google policy says. “For example, we use publicly available information to help train Google’s AI models and build products and features like Google Translate, Bard, and Cloud AI capabilities.”

Gizmodo notes that the policy has been updated to say “AI models” when it previously said, “for language models.” Additionally, the policy added Bard and Cloud AI, when it previously only mentioned Google Translate, for which it collected data.

The privacy policy, which was updated over the weekend, appears especially ominous because it indicates that any information you produce online is up for grabs for Google to use for training its AI models.

The aforementioned wording seems to describe not just those in the Google ecosystem in one way or another but is detailed in such a way that the brand could have access to information from any part of the web.

Major issues surrounding the mass development of artificial intelligence are questions about privacy, plagiarism, and whether AI can dispel correct information. Early versions of chatbots such as ChatGPT are based on large language models (LLMs) that used already public sources, such as the common crawl web archive, WebText2, Books1, Books2, and Wikipedia as training data.

Early ChatGPT was infamous for becoming stuck on information beyond 2021 and subsequently filling in responses with false data. This could likely be one of the reasons Google would want unfettered access to web data to benefit tools such as Bard, to have real-world and potentially real-time training for its AI models.

Gizmodo also noted that Google could use this new policy to collect old, but still human-generated content, such as long-forgotten reviews or blog posts, to still have a feel of how human text and speech is developed and distributed. Still, it remains to be seen exactly how Google will use the data it collects.

Several social media platforms, including Twitter and Reddit, which are major sources of up-to-date information have already limited their public access in the wake of AI chatbot popularity, to the chagrin of their entire communities.

Both platforms have closed free access to their APIs, which restricts users from downloading massive amounts of posts for sharing elsewhere, under the guise of protecting their intellectual property. This instead broke many of the third-party tools that make both Twitter and Reddit run smoothly.

Both Twitter and Reddit have had to deal with other setbacks and controversies as their owners’ concerns heighten about AI taking over.

Fionna Agomuoh
Fionna Agomuoh is a Computing Writer at Digital Trends. She covers a range of topics in the computing space, including…
Google might have to sell Chrome — and OpenAI wants to buy it
OpenAI press image

It feels like all of the big tech companies practically live in courtrooms lately, but it also feels like not much really comes of it. Decisions get made and unmade again, and it takes a long time for anything to affect consumers. At the moment, Google is in danger of getting dismantled and sold for parts -- and if it really happens, OpenAI has told the judge that it would be interested in buying.

OpenAI, the company behind ChatGPT, currently doesn't work with Google at all. Apparently, it wanted to make a deal last year to use Google's search technology with ChatGPT but it didn't work out. Instead, OpenAI is now working on its own search index but it's turning out to be a much more time-consuming project than anticipated.

Read more
The Academy Awards have new film rules. AI is now okay for the Oscars
Robots touching Oscar award.

In 2024, Hollywood was roiled by protests led by the SAG-AFTRA union, fighting for fair rights over their physical and voice identities in the age of AI. A deal was inked late last year to ensure that artists are fairly compensated, but the underlying current was obvious. 

AI in films is here to stay. 

Read more
Meta is training AI on your data. Users say opting out doesn’t work.
Meta AI WhatsApp widget.

Imagine a tech giant telling you that it wants your Instagram and Facebook posts to train its AI models. And that too, without any incentive. You could, however, opt out of it, as per the company. But as you proceed with the official tools to back out and prevent AI from gobbling your social content, they simply don’t work. 

That’s what users of Facebook and Instagram are now reporting. Nate Hake, publisher and founding chief of Travel Lemming, shared that he got an email from Meta about using his social media content for AI training. However, the link to the opt-out form provided by Meta doesn’t work.

Read more