Skip to main content

DALL-E 3 could take AI image generation to the next level

DALL-E 2DALL-E 2 Image on OpenAI.
OpenAI

OpenAI might be preparing the next version of its DALL-E AI text-to-image generator with a series of alpha tests that have now been leaked to the public, according to the Decoder.

An anonymous leaker on Discord shared details about his experience, having access to the upcoming OpenAI image model being referred to as DALL-E 3. He first appeared in May, telling the interest-based Discord channel that he was part of an alpha test for OpenAI, trying out a new AI image model. He shared the images he generated at the time.

We've NEVER seen Image Generation This Good! | SNEAK PEAK

The May alpha test version had the ability to generate images of multiple aspect ratios inside the image model. YouTuber, MattVidPro AI then showcased several of the images that were generated in a 16:9 aspect ratio. This version also showed the model’s prowess for high-quality text production, which continues to be a pain point for rival models, even for top generators such as Stable Diffusion and Midjourney.

Some examples showcased images, such as text melded into a brick wall, a neon sign of words, a billboard sign in a city, a cake decoration, and a name etched into a mountain. The model maintains that DALL-E is good at generating people. One such image displayed a woman eating spaghetti at a party from a fisheye point of view.

The leaker returned to the Discord channel in mid-July with more details and new images. He claimed to be a part of a “closed alpha” test version that included approximately 400 subjects. He added that he was invited to the trial via email and was also included in the testing of the original DALL-E and DALL-E 2. This is what led to the conclusion that the alpha test might be for DALL-E 3, though it has not been confirmed.

The model has been updated considerably between May and July. The leaker has showcased this by sharing images generated based on the same prompt, showing how powerful DALL-E 3 has gotten over time. The prompt reads a painting of a pink jester giving a high five to a panda while in a cycling competition. The bikes are made of cheese and the ground is very muddy. They are driving in a foggy forest. The panda is angry.

The May alpha produces the general scene that hits most of the points of the prompt. There’s a little distortion in the hands connecting, and the wheels of the bikes are yellow as opposed to being made of cheese. However, the July alpha is far more detailed, with the pink jester and the panda clearly high-fiving and the bicycle wheels made of cheese in several generations.

Meanwhile, in Midjourney, the jester is missing from the scene, the pandas are on motorcycles instead of bicycles. There are roads, instead of mud. The pandas are happy instead of angry.

There are a host of DALL-E 3 July alpha image examples that show the potential of the model. However, with the alpha test being uncensored, the leaker noted that also has the potential to generate scenes of “violence and nudity or copyrighted material such as company logos.”

Some examples include a gory anime girl, a Game of Thrones character, a Grand Theft Auto V cover, a zombie Jesus eating a Subway sandwich, also suggesting mild gore, and Shrek being dug up from an archeological dig, among others.

MattVidPro AI noted that the image model generates images as if they’re supposed to be in a specific style.

DALL-E 2 launched in April 2022 but was heavily regulated with a waitlist due to its popularity and concerns about ethics and safety. The AI image generator became accessible to the public in September 2022.

Editors' Recommendations

Fionna Agomuoh
Fionna Agomuoh is a technology journalist with over a decade of experience writing about various consumer electronics topics…
Stable Diffusion aims to fix its problem with generating fingers
Stable Diffusion AI image generator.

Future iterations of AI-generated art are set to be more realistic thanks to an upcoming version of Stable Diffusion that specifically tackles the problem of depicting fingers and hands.

According to a recent Bloomberg report, the company Stability AI, which develops the Stable Diffusion AI image generator, has plans to release a new SDXL 0.9 model that will propel the abilities of Stable Diffusion.

Read more
Meta’s new AI app is both for patients with vocal cord damage and in-game NPCs
Audio-Technica AT-SB727 Sound Burger rear panel and carry strap.

Meta (formerly Facebook) is introducing its first artificial intelligence offering since the AI generator industry exploded in late 2022.

The brand's text-to-audio generator, called Voicebox is expected to be the voice equivalent of ChatGPT, which processes text prompts into detailed written results, and Dall-E which develops realistic artwork. Voicebox in turn will be able to take text prompts and produce audio clips, according to Engadget.

Read more
This new Photoshop tool could bring AI magic to your images
A mountainous landscape at night with the Northern Lights in the sky, a lake in the foreground, and a person standing under a rock archway on the right. This image was made with Adobe Photoshop's Generative Fill tool.

These days, it seems like everyone and their dog is working artificial intelligence (AI) into their tech products, from ChatGPT in your web browser to click-and-drag image editing. The latest example is Adobe Photoshop, but this isn’t just another cookie-cutter quick fix -- no, it could have a profound effect on imagery and image creators.

Photoshop’s newest feature is called Generative Fill, and it lets you use text prompts to automatically adjust areas of an image you are working on. This might let you add new features, adjust existing elements, or remove unwanted sections of the picture by typing your request into the app.

Read more