Skip to main content

MIT algorithm can predict the (immediate) future from still images

Creating Videos of the Future
Humans still can’t predict elections but we’re pretty good at predicting the immediate future. Baby drops glass cup, cup falls and shatters, and baby starts to cry. We’re so good at these short-term forecasts that we can often even describe what events will happen next in an image.

But what’s second nature for us can prove complicated for computers. Will the glass break or bounce? Will the baby laugh or cry?

Recommended Videos

A team of researchers from the Massachusetts Institute of Technology (MIT) Computer Science and Artificial Intelligence Laboratory (CSAIL) have developed a system that can predict the following events in images and generate videos to depict them. The system needs work — its current productions are simple, short, and unassuming — but it stands out for its unique approach and accuracy.

“Instead of building up scenes frame by frame, we focus on processing the entire scene at once,” Carl Vondrick, PhD at MIT CSAIL and lead author of the paper, told Digital Trends.
video-examples-with-input-and-output

Alternative computer vision models that attempt the same task use recurrent networks to generate predictive videos on a frame-by-frame basis. The system developed by Vondrick and his team uses “convolutional networks” to generate all 32 frames simultaneously.

“The existing approach of going frame by frame has a certain logic,” Vondrick said, “but it also creates a massive margin for error. It’s sort of like a big game of ‘Telephone,” which means that the message most likely will fall apart by the time you go around the whole room.

“In contrast, our approach is the ‘Telephone’ equivalent of speaking to everyone in the room at once,” he added.

The researchers trained the system on a year of footage packed into two million videos and — in order to generate all frames at once — taught it distinguish foregrounds from backgrounds, and mobile objects from stationary ones. They then showed the system still images and had it generate short clips of subsequent events.

Once the system could generate video clips, Vondrick and his team set out to refine it through a method called adversarial learning.

“The idea behind adversarial learning is to have two neural networks compete against each other,” Vondrick said. “One network tries to decide what is real versus fake, and another tries to generate something that fools the first network.”

Through this computer competition the generative algorithm improved the accuracy of its video clips until it was able to fool human subjects 20 percent more often than a baseline model, according to a paper that will be presented next week at the Neural Information Processing Systems conference in Barcelona.

But with accuracy comes complexity and with complexity comes obstacles.

The current system’s videos are short — a mere one and a half seconds long. If the clips were much longer than that, they’d risk their consistency. “The key challenge is being able to reliably track the relationships between all of the objects in a scene … to make sure that the video that’s being generated still makes sense five or ten seconds later,” Vondrick said. To develop accurate and long videos, the system may need human input to help it grasp context and connection between seemingly unrelated actions, such as jogging and showering.

Vondrick’s ambitious end goal is to develop an algorithm that can create believable feature-length films, though he admits that is still some years off. In the near term though he thinks this system could refine AI systems by helping them adapt to unpredictable environments.

Dyllan Furness
Former Digital Trends Contributor
Dyllan Furness is a freelance writer from Florida. He covers strange science and emerging tech for Digital Trends, focusing…
The best cheap laptop just got cheaper during Prime Big Deal Days
The Best Cheap Laptop Just Got Cheaper During Prime Big Deal Days

Acer’s Aspire 15” Laptop is a versatile workhorse beloved by students, professionals, remote workers, and casual gamers alike. And for the next couple of days, you’ll be hard-pressed to find a better laptop for $500, thanks to Prime Big Deal Days on October 8 and 9.

The Aspire 15” is now 23% off for Prime Day — and it's more than the sum of its parts (although the parts are pretty dang good, too). Features like a solid-state drive, a full HD display, and a powerful AMD Ryzen 5 5500U processor are typically reserved for machines that cost hundreds more.

Read more
Prime Big Deal Days Dell XPS Deals 2024: laptops and desktops
Dell XPS 13 9345 front angled view showing display and keyboard.

Update 10/08/24: Dell XPS deals are really rolling out for Prime Day, and so we've scoured the big retailers to find the best deals and updated them below for you, as well as adjusted prices where needed.

With the Amazon Prime Big Deal Days event underway on October 8 and 9, there are some fantastic Prime Day deals happening, and we’re not just talking about at Amazon either. That’s because many other retailers get involved, such as Dell. Let's focus on Dell XPS deals. These are the best ways to score a great desktop PC or laptop for far less than usual. These Prime Day laptop deals are sure to be something special. We’re also counting on some sweet Prime Day gaming PC deals if you don’t mind the XPS badge rather than Alienware. Read on and we’ll take you through the very best Dell XPS deals and some key buying advice.
Dell XPS 15 -- $1,059 $1,359 22% off

Read more
Get 18% off Alienware’s M18 R2 gaming laptop for Prime Day
Take high processing speeds to go with 18% off Alienware’s M18 R2 gaming laptop for Prime Day
The Alienware m18 R2 gaming laptop with a game on the screen.

Like a pre-hibernation raccoon that got wedged under your deck while foraging for trash, the Alienware M18 R2 gaming laptop makes no apologies for its size. With an 18-inch screen and a heft of nearly 10 pounds, it just isn’t particularly portable. But if that’s no problem for you, then now is the perfect time to gorge yourself on an absolute unit of a gaming laptop by taking  during Prime Big Deal Days on October 8 and 9.

When it hit the market earlier this year, the M18 R2 kept the R1’s sleek metal and plastic shell but improved on the original model with AI-enhanced graphics powered by NVIDIA’s GeForce RTX 40-series GPU. Combined with an Intel Core i7-14650 processor, a 2TB solid-state drive, and 32GB of RAM, the M18 R2 tops out at a maximum turbo frequency of 5.5 GHz (and performs steadily at 2.2 GHz).

Read more