Skip to main content

OpenAI’s new Shap-E tool is Dall-E for 3D objects

OpenAI‘s latest endeavor, Shap-E, is a model that allows you to generate 3D objects from text, not unlike how Dall-E can create 2D images.

According to OpenAI, Shap-E is “a conditional generative model for 3D assets. Unlike recent work on 3D generative models which produce a single output representation, Shap-E directly generates the parameters of implicit functions that can be rendered as both textured meshes and neural radiance fields.”

Image used with permission by copyright holder

The company’s GitHub posting goes on to explain how Shap-E is trained on a combination of mapping 3D assets and a conditional diffusion model.

However, this free-to-run program is a little more challenging to install and set up than the company’s ever-popular ChatGPT, as was explored by Tom’s Hardware.

You can download the Shap-E model on GitHub at no charge and access it on Microsoft Paint 3D. It also works when converted into an STL file, which allows the renders you create to be brought to life via 3D printers.

While this basic knowledge of the Shap-E model might seem simple enough, some tech savviness might be required to get the model installed and running.

The publication’s editor-in-chief, Avram Piltch, tested out Shap-E, which he claims took him eight hours to wrap his mind around. He added that OpenAI offers little by way of instructions outside of explaining that you should use a Python pip command for installation.

Once installed, Piltch says he was able to test prompts with color-animated GIF files and monochrome PLY files, with the animated GIFs being favorable, he noted.

Some prompts included a shark, a Minecraft creeper, and “an airplane that looks like a banana,” all of which had varying levels of quality depending on their file type. Piltch also used the model’s function, which lets users upload a 2D image for conversion into a 3D object.

The editor noted that those attempting to install Shap-E and render 3D objects should keep in mind that the model requires a lot of system resources from a PC.

In particular, Shap-E is compatible only with Nvidia GPUs and requires high-performance CPUs to render in a matter of minutes as opposed to hours.

Fionna Agomuoh
Fionna Agomuoh is a technology journalist with over a decade of experience writing about various consumer electronics topics…
ChatGPT can now generate images for free using Dall-E
ChatGPT results on an iPhone.

Since its launch last September, OpenAI's Dall-E 3 image generator has only been available to its Plus, Teams, and Enterprise subscribers. Now, nearly a year later, Dall-E is accessible to the rest of us — just with some stringent restrictions.

https://twitter.com/OpenAI/status/1821644904843636871

Read more
The ChatGPT app for Mac just got this helpful new feature
The OpenAI desktop app showing the text input window

OpenAI's recently released Mac desktop app is getting a bit easier to use. The company has announced that the program will now offer side-by-side access to the ChatGPT text prompt when you press Option + Space.

The desktop version offers nearly identical functionality to the web-based iteration. Users can chat directly with the AI, query the system using natural language prompts in either text or voice, search through previous conversations, and upload documents and images for analysis. You can even take screenshots of either the entire screen or just a single window, for upload.

Read more
ChatGPT Advanced Voice mode: release date, compatibility, and more
Nothing Phone 2a and ChatGPT voice mode.

Advanced Voice Mode is a new feature for ChatGPT that enables users to hold real-time, humanlike conversations with the AI chatbot without the need for a text-based prompt window or back-and-forth audio. It was released in late July to select Plus subscribers after being first demoed at OpenAI's Spring Update event.

According to the company, the feature “offers more natural, real-time conversations, allows you to interrupt at any time, and senses and responds to your emotions.” It can even take breath breaks and simulate human laughter during conversation. The best part is that access is coming soon, if you don't have it already.
When will I get Advanced Mode?
Introducing GPT-4o

Read more