Skip to main content

Meta made DALL-E for video, and it’s both creepy and amazing

Meta unveiled a crazy artificial intelligence model that allows users to turn their typed descriptions into video. The system is called Make-A-Video and is the latest in a trend of AI generated content on the web.

The system accepts short descriptions like “a robot surfing a wave in the ocean” or “clown fish swimming through the coral reef” and dynamically generates a short GIF of the description. There are even three different styles of videos to choose from: surreal, realistic, and stylized.

Related Videos
An artist’s brush painting on a canvas close up

According to a Facebook post by Meta CEO, Mark Zuckerberg, translating written text into video is much harder because of how video requires movement:

“It’s much harder to generate video than photos because beyond correctly generating each pixel, the system also has to predict how they’ll change over time. Make-A-Video solves this by adding a layer of unsupervised learning that enables the system to understand motion in the physical world and apply it to traditional text-to-image generation.”

A young couple walking in a heavy rain

Meta’s AI Research team wrote a paper describing how the system works and how it differs from current text-to-image (T2I) methods. Unlike other machine language models, Meta’s Text-to-Video (T2V) method doesn’t use pre-defined text-video pairs. For example, it doesn’t pair “man walking” with a video of an actual man walking.

If this sounds a lot like DALL-E, the popular T2I application, you wouldn’t be far off. Other T2I applications have rolled out since DALL-E gained popularity. TikTok released a filter in August called AI Greenscreen that generates painting style images based on the words you type.

A fluffy baby sloth with an orange knitted hat trying to figure out a laptop close up highly detailed studio lighting screen reflecting in its eye

AI-generated content has become quite buzzworthy within the last few years. Deepfake technology, machine learning techniques to replace a person’s face with another, is even used by visual effects studios for big budget shows like The Mandalorian.

In July, The Times mistakenly reported on a Ukrainian woman in the midst of the Russia-Ukraine war. The problem is she wasn’t real.

The threat of AI probably isn’t a real threat, but projects like DALL-E and Make-A-Video are fun explorations into some of the interesting possibilities.

Editors' Recommendations

Midjourney v5 language model update adds realism to human hands
Midjourney Join the Beta.

Midjourney v5 is the latest language model of the popular text-to-image generator known for its realistic creations.

The update rolled out to Midjourney's paid customer base on Wednesday and many users, including Graphic designer Julie Wieland, have been sharing their new AI-generated artwork. AI details that the v5 language model brings with it include improved "efficiency, coherency, and quality," Midjourney said on its website.

Read more
Here’s the ChatGPT word limit and how to get around it
A laptop opened to the ChatGPT website.

Fans of ChatGPT adore the AI chatbot for several purposes, including its ability to generate detailed essays in a matter of seconds. However, one of its little-known limitations is that there is a word and character limit set on how much content it can output per query.
Reddit members and other AI enthusiasts have been discussing this for months, and luckily there are easy workarounds for this limitation by way of the prompts you can use.

What is the ChatGPT word limit?
ChatGPT's parent company, OpenAI. set the word and character limit as part of its ongoing development of the AI chatbot, which is still in its research preview phase. Some of the issues with ChatGPT include its affinity toward "social biases, hallucinations, and adversarial prompts," in addition to producing inaccurate content when the AI algorithm is overwhelmed or at a loss for information to process.
Similarly, ChatGPT might simply stop producing content when the request is too complex for the AI to handle. This happens at about 500 words or 4,000 characters. If you happen to give the chatbot a request for a specific number of words above 500, you might find that it cuts off midsentence somewhere after 500 words.

Read more
GPT-4: how to use, new features, availability, and more
A laptop opened to the ChatGPT website.

ChatGPT-4 has officially been announced, confirming the longtime rumors around its improvements to the already incredibly impressive language skills of OpenAI's ChatGPT.

OpenAI calls it the company's "most advanced system, producing safer and more useful responses." Here's everything we know about it so far.
Availability

Read more