Skip to main content

Get ready: AI generated-GIFs might be coming soon

With chatbots and text-to-image generators taking the internet by storm, the next frontier of AI might be text-to-video generators.

Nvidia recently published a research paper called “High-Resolution Video Synthesis with Latent Diffusion Models” on its experiments at its Toronto AI Lab that details how it uses Stable Diffusion to create a tool that can make moving art results from text prompts.

Recommended Videos

The tech company showcased demos of the Latent Diffusion Models (LDMs), which use text to generate video clips without large amounts of computer processing, TechRadar noted.

The tool is able to generate GIF-style moving images that are approximately 4.7-second long videos at a 1,280 x 2,048 resolution. It is also capable of creating longer videos at a lower resolution of 512 x 1024, according to the research paper.

Having viewed a demo of the technology, TechRadar said the tool is likely ideal as a text-to-GIF generator at this point. The publication noted it could easily handle simple prompts such as a stormtrooper vacuuming on the beach or teddy bear is playing the electric guitar, high definition, 4K. Even so, the result still produced random artifacts and smudging in the GIFs, as are common on other regularly used AI tools such as Midjourney.

The publication believes longer videos still need a little more development before they hit prime time, but feels Nvidia will work quickly to get the technology ready. They might work well for stock libraries and similar purposes.

There are other companies experimenting with AI text-to-video generators. Google demoed its Phenaki generator, which allows longer prompts that produce 20-second clips. Another startup called Runway announced its second-generation video model last month, which is also based on Stable Diffusion. Its demo of the prompt the late afternoon sun peeking through the window of a New York City loft shows how you can add slight moving effects to still images.

Users also stand to benefit from the addition of AI in other programs, such as Adobe Firefly and Adobe Premiere Rush, according to TechRadar.

Some other companies, such as Narakeet and Lume5, market themselves as having text-to-video generators. However, many of these tools work more like PowerPoint presentations, putting together text, audio, images, and perhaps some already produced clips of video with prompts, as opposed to generating a unique work.

Please enable Javascript to view this content

Fionna Agomuoh
Fionna Agomuoh is a Computing Writer at Digital Trends. She covers a range of topics in the computing space, including…
Microsoft Copilot gets an AI agent to browse the web for you
Launching a search with Microsoft Copilot Actions.

Microsoft’s 50th anniversary event was quite loaded, but the company reserved most of its attention for the Copilot AI stack. The buzzy event introduced two crucial upgrades – Actions and Deep Research — which firmly push Copilot into the realm of agentic AI.

Agentic AI is essentially a fancy way of describing an AI tool that can perform multi-step web-based tasks autonomously, or semi-autonomously, on your behalf. In Copilot’s case, the fancier one is Actions. So far, AI chatbots have mostly been able to give answers based on a certain input, but haven’t been able to perform autonomous multi-stage actions.

Read more
Midjourney’s new image generation model announced to take on OpenAI’s GPT-4o
Midjourney logo on web explore feed.

Even though MidJourney set out to be one of the most promising image generation models in the early days of AI, it appears to have fallen behind more accessible, easy to use, and free tools such Gemini, ChatGPT, and Bing. Adding to its woes is the latest update to OpenAI's GPT-4o model which allows exceptionally good image generation with the ability to recreate real photos and produce immaculate text. So to stay relevant -- or perhaps catch the hype train being shunted by the wave of Studio Ghibli-inspired AI art flooding the internet, MidJourney is rolling out an updated model with several improvements.

CEO David Holz announced details of the new V7 model on MidJourney's official Discord server and through a blog post. They said the new model is "smarter with text prompts" and produces images with "noticeably higher" quality and "beautiful textures."

Read more
OpenAI is ready to embrace an open weight AI model strategy
OpenAI press image

OpenAI is set to be the next open-source AI brand as CEO Sam Altman confirmed on X on Monday that the company will soon release an “open-weight’ model that users will be able to run independently.  

“We are excited to release a powerful new open-weight language model with reasoning in the coming months,” Altman said on a post on X. 

Read more