Skip to main content

Forget text-to-image; this AI makes videos from your prompts

You’ve likely heard about the amazing results realized by text-to-image AI such as Dall-E, Stable Diffusion, and Midjourney. As you might have expected, the revolution is marching onward, with the next target being text-to-video AI tools.

QuickVid generated this video about a DJI Drone and astronauts on Mars.

Google and Meta have teased their text-to-video capabilities in research reports from their AI labs, but this advanced technology hasn’t been available to the public. If you’ve been eagerly awaiting the chance to try creating entire videos with a simple AI prompt, now’s your chance, thanks to QuickVid.

Recommended Videos

Before your expectations climb too high, it’s important to realize that this isn’t equivalent to generating thousands of Stable Diffusion stills and assembling them to create a video or getting access to the most advanced AI systems in the world for true video generation. This is a very early entry into the race for a text-to-video solution.

The first step of the process for the AI is to generate a script based on your prompt. I tested the system by creating a YouTube Short from these words: “A video of a DJI drone flying over an astronaut on Mars, ending with a reaction shot of the surprised astronaut.”

The AI wrote a complete, 79-word narrative from my prompt, then synthesized the speech with a choice of a male or female voice. TechCrunch pointed out that the background video chosen for the generated video is taken from a stock library and there was apparently plenty of footage of “astronauts on Mars.”

As a questionable finishing touch, QuickVid overlays the script as titles and adds thumbnail images generated by the Dall-E API. The resulting YouTube short seen above is … interesting. Perhaps, it would handle more earthly videos better.

In a TechCrunch interview, the developer of QuickVid said improvements are coming, with more personalization options arriving in January. Eventually, QuickVid will also include captions and support avatars.

Next year could see many more text-to-video solutions arrive, along with other visual wonders such as AR glasses and more advanced VR headsets. It should be exciting.

Alan Truly
Former Digital Trends Contributor
Alan Truly is a Writer at Digital Trends, covering computers, laptops, hardware, software, and accessories that stand out as…
More AI may be coming to YouTube in a big way
a content creator recording a thing in the kitchen with a bowl of food

YouTube content creators could soon be able to brainstorm video topic, title, and thumbnail ideas with Gemini AI as part of the "brainstorm with Gemini" experiment Google is currently testing, the company announced via its Creator Insider channel.

The feature is first being released to a small number of selected content creators for critique, as a spokesperson from the company told TechCrunch, before the company decides whether to roll it out to all users. "We're collecting feedback at this stage to make sure we're developing these features thoughtfully and will improve the feature based on feedback," the video's host said.

Read more
Intel’s new AI image generation app is free and runs entirely on your PC
screenshot of AI Playground image creation screen showing more advanced ccontrols

Intel shared a sneak preview of its upcoming AI Playground app at Computex earlier this week, which offers yet another way to try AI image generation. The Windows application provides you with a new way to use generative AI a means to create and edit images, as well as chat with an AI agent, without the need for complex command line prompts, complicated scripts, or even a data connection.

The interesting bit is that everything runs locally on your PC, leveraging the parallel processing power of either an Intel Core Ultra processor with a built-in Intel Arc GPU or through a separate 8GB VRAM Arc Graphics card.

Read more
OpenAI’s latest Sora video shows an elephant made of leaves
An elephant made of leaves, created by OpenAI's Sora technology.

OpenAI left a lot of jaws on the floor last month when it shared the first footage made by Sora, its AI-powered text-to-video generator.

While not perfect, the quality was extraordinary and left many wondering about the kind of transformational impact that such technology will have on the creative industries, including Hollywood.

Read more