Skip to main content

AI is now being trained by AI to become a better AI

An OpenAI graphic for ChatGPT-4.
OpenAI

OpenAI has developed an AI assistant, dubbed CriticGPT, to help its crowd-sourced trainers further refine the GPT-4 model. It spots subtle coding errors that humans might otherwise miss.

After a large language model like GPT-4 is initially trained, it subsequently undergoes a continual process of refinement, known as Reinforcement Learning from Human Feedback (RLHF). Human trainers interact with the system and annotate the responses to various questions, as well as rate various responses against one another, so that the system learns to return the preferred response and increases the model’s response accuracy.

The problem is that as the system’s performance improves, it can outpace the level of expertise of its trainer, and the process of identifying mistakes and errors becomes increasingly difficult.

These AI trainers aren’t always subject matter experts, mind you. Last year, OpenAI got caught crowd sourcing the effort to Kenyan workers — and paying them less than $2 an hour — to improve its models’ performance.

a criticGPT screenshot
OpenAI

This issue is especially difficult when refining the system’s code generation capabilities, which is where CriticGPT comes in.

“We’ve trained a model, based on GPT-4, called CriticGPT, to catch errors in ChatGPT’s code output,” the company explained in a blog post Thursday. “We found that when people get help from CriticGPT to review ChatGPT code they outperform those without help 60 percent of the time.”

What’s more, the company released a whitepaper on the subject, titled “LLM Critics Help Catch LLM Bugs,” which found that “LLMs catch substantially more inserted bugs than qualified humans paid for code review, and further that model critiques are preferred over human critiques more than 80 percent of the time.”

Interestingly, the study also found that when humans collaborated with CriticGPT, the AI’s rate of hallucinating responses was lower than when CriticGPT did the work alone, but that rate of hallucination was still higher than if a human just did the work by themselves.

Andrew Tarantola
Former Computing Writer
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
You can now interact with Google’s AI Mode in search results
Google AI Mode

Google has been working on adding more AI features to its Search feature, and now an integrated AI Mode is being rolled out to the public. Different from the AI Overview function that Google has included as a default part of Search since last year, the AI Mode is a chatbot which users can interact with as part of their search results.

AI Mode has previously only been available as a Google Labs experiment, but now Google says that it will be coming to search for "a small percentage of people" in the U.S. over the coming weeks. Those who are part of the test will see an "AI Mode" tab in Search, and clicking on it will bring up information related to your search from the chatbot.

Read more
Meta’s new AI app lets you share your favorite prompts with friends
Meta AI WhatsApp widget.

Meta has been playing the AI game for a while now, but unlike ChatGPT, its models are usually integrated into existing platforms rather than standalone apps. That trend ends today -- the company has launched the Meta AI app and it appears to do everything ChatGPT does and more.

Powered by the latest Llama 4 model, the app is designed to "get to know you" using the conversations you have and information from your public Meta profiles. It's designed to work primarily with voice, and Meta says it has improved responses to feel more personal and conversational. There's experimental voice tech included too, which you can toggle on and off to test -- the difference is that apparently, full-duplex speech technology generates audio directly, rather than reading written responses.

Read more
Ray-Ban Meta Glasses are my favorite AI gadget, and they keep getting better
Ray-Ban Meta Glasses worn by Prakhar Khanna.

Meta announced its Ray-Ban AI Glasses in October 2023, and while the company hasn’t launched a successor yet, it has steadily expanded the feature set, turning them into my favorite AI gadget. These are all quality-of-life upgrades that would ideally be released with the next-gen product. But Meta has announced the expansion of Ray-Ban Meta Glasses to more regions and new Meta AI features rolling out starting this week.

I bought a pair of Headliner Meta Ray-Bans in January 2024, and they’ve been my travel companion ever since. It's not because I can record videos while on the go, but because they are the first AI device that doesn’t scream AI. The ambient presence of tech is what makes them special, and they’re only improving, even after 18 months since launch.

Read more