Skip to main content

Nvidia turns simple text prompts into game-ready 3D models

A colorful collage of images generated by Nvidia's LATTE3D.
Nvidia

Nvidia just unveiled its new generative AI model, dubbed Latte3D, during GTC 2024. Latte3D appears to be ChatGPT on extreme steroids. I’s a text-to-3D model that accepts simple, short text prompts and turns them into 3D objects and animals within a second. Much faster than its older counterparts, Latte3D works like a virtual 3D printe that could come in handy for creators across many industries.

Latte3D was made to simplify the creation of 3D models for many types of creators, such as those working on video games, design projects, marketing, or even machine learning and training for robotics. In Nvidia’s demo of the model, it appears super simple to use. Following a quick text prompt, the AI generates a 3D model and shortly after finishes it off with much more detail. While the end result is nowhere near as lifelike as OpenAI’s Sora, it’s not meant to be — this is a way to speed up creating assets instead of having to build them from the ground up.

The model generates several different options for the user to choose from, and Nvidia says that these shapes can be “optimized for higher quality within a few minutes.” The designs can then be exported to different platforms, such as Nvidia’s Omniverse, and can be tweaked to match the desired end result. Nvidia trained Latte3D by using its Ada A100 Tensor Core GPUs and supported the training with ChatGPT prompts to ready it for interacting with real users.

Get your weekly teardown of the tech behind PC gaming
Check your inbox!

As of right now, Latte3D can only generate objects and animals. To that end, it appears to do a solid job of discerning different animals, textures, and object types. Nvidia showed off these capabilities by presenting objects such as an amigurumi (crochet) common crane or an origami sphynx cat. The model was taught to recognize various species and thus can tell the difference between an Italian greyhound and a Shiba Inu.

LATTE3D Text to 3D Generative AI Model from NVIDIA Research

Creators who want to use Latte3D to do more can train it on a different dataset, be it plants or household objects, and later use it for their own purposes. Nvidia brings up some interesting use cases here, such as training personal assistant robots before deploying them. It’s easy to imagine that Latte3D will come in handy for game devs, but the potential goes far beyond just gaming scenarios.

Sanja Fidler, vice president of AI research at Nvidia, remarked on how much faster Latte3D is compared to its predecessors: “A year ago, it took an hour for AI models to generate 3D visuals of this quality — and the current state of the art is now around 10 to 12 seconds. We can now produce results an order of magnitude faster,” said Fidler.

The recent announcements related to using AI in game development are all pretty groundbreaking, and Nvidia’s Latte3D joins a growing list of tools that may one day completely change the process of creating a game. For instance, Nvidia just recently unveiled non-player characters (NPCs) with dialogue entirely generated by AI. Meanwhile, Unreal Engine’s latest update can generate film-quality visuals in games in real time, all with the help of machine learning.

Monica J. White
Monica is a computing writer at Digital Trends, focusing on PC hardware. Since joining the team in 2021, Monica has written…
Few people are using ChatGPT and other AI tools regularly, study suggests
ChatGPT app running on an iPhone.

Not a day seems to go by without generative-AI products like ChatGPT making the news, but few people are actually making regular use of the tools, a new study suggests.

The study was carried out by the Reuters Institute and Oxford University, and it involved 6,000 respondents from the U.S., U.K., France, Denmark, Japan, and Argentina. Researchers found that OpenAI's ChatGPT is by far the most widely used generative-AI tool and is two or three times more widespread than the next most widely used products -- Google Gemini and Microsoft Copilot.

Read more
Why Llama 3 is changing everything in the world of AI
Meta AI on mobile and desktop web interface.

In the world of AI, you've no doubt heard about what OpenAI and Google have been up to. And now, Meta's Llama LLM (large language model) is becoming an increasingly important player in the game, especially with its open-source nature. Meta recently made a big splash with the launch of its Llama 3 AI model, and it's shaken up the field dramatically.

The reasons why are multiple and varied. It's free to use, it has a wide user base, and yes, it's open source, to name but a few. Here's why Llama 3 is taking the AI industry by storm and may shape its future for some time to come.
Llama 3 is really good
We can debate until the cows come home about how useful AIs like ChatGPT and Llama 3 are in the real world -- they're not bad at teaching you board game rules -- but the few benchmarks we have for how capable these AI are give Llama 3 a distinct advantage.

Read more
GPT-4 vs. GPT-3.5: how much difference is there?
Infinix Zero 30 5G Android phone in gold color with ChatGPT virtual assistant.

The ChatGPT chatbot is an innovative AI tool developed by OpenAI. As it stands, there are two main versions of the software: GPT-4 and GPT-3.5. Toe to toe in more ways than one, there are a couple of key differences between both versions that may be deal-breakers for certain users. But what exactly are these differences? We’re here to help you find out. 

We’ve put together this side-by-side comparison of both ChatGPT versions, so when you’re done reading, you’ll know what version makes the most sense for you and yours.
What are GPT 3.5 and GPT-4?

Read more