Skip to main content
  1. Home
  2. Emerging Tech
  3. Computing
  4. News

Neural network can create high-res images based on a text description

Add as a preferred source on Google

As far as artificial intelligence goes, 2016 has been the year of deep learning. Brain-inspired neural networks have received massive amounts of investment in time, resources and funding — and, boy, has it ever paid off!

In a new piece of research — carried out by investigators at Rutgers University, the University of North Carolina at Charlotte, Lehigh University, and the Chinese University of Hong Kong — neural networks have been used to generate high quality images based on nothing more detailed than basic text descriptions.

Recommended Videos

“Generating realistic images from text descriptions has many applications,” researcher Han Zhang told Digital Trends. “Previous approaches have difficulty in generating high resolution images, and their synthesized images in many cases lack details and vivid object parts. Our StackGAN for the first time generates 256 x 256 images with photo-realistic details.”

A video of the work was shared online by YouTuber Károly Zsolnai-Fehér as part of his excellent series of Two Minute Papers educational videos.

Image used with permission by copyright holder

“For many years, we have trained neural networks to perform tasks like face, traffic sign, or handwriting recognition,” Zsolnai-Fehér told us. “Generally, with millions of training examples, we show the neural network how to do something, and expect them to learn these concepts, and do well on their own afterwards. This piece of work is completely different: here, after learning the neural networks are able to create something completely new — such as synthesizing new, photorealistic images from a piece of text we have written. This opens up a world of possibilities, and I am super-excited to see where researchers take this concept in the future.”

While there have certainly been examples of computational creativity before — ranging from MIT’s Nightmare Machine to projects that can generate predictive video simply by looking at a still image — this is nonetheless an intriguing piece of work. It’s also fascinating because the two-stage method of drawing images looks, to our way of thinking, a whole lot like the way artists will sketch out a piece of work, and then do a second pass to add detail.

We may still be a way from replacing human illustrators with robots, but this is nonetheless an exciting leap forward.

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
You can now generate images with Gemini’s memory without paying a dime
Study guide created by Gemini

Google has made one of Gemini's most interesting AI tricks a lot easier to try. The company is rolling out its personalized image generation feature to eligible U.S. users for free, removing a paywall that previously kept it exclusive to Gemini's paid tiers.

Powered by Google's Nano Banana image model, the feature does more than generate pretty pictures; it taps into Gemini's understanding of you, making AI-generated images feel surprisingly personal.

Read more
Meta’s Brain2Qwerty v2 turns thoughts into text, and it doesn’t need brain implants
The latest AI model decodes brain signals into coherent sentences using external scanners.
Meta Brain2Qwerty v2 Featured

Artificial intelligence is getting surprisingly good at understanding humans. Now, Meta wants it to understand our brains too. The company has unveiled Brain2Qwerty v2, an upgraded AI system that can translate brain activity into full sentences, all without requiring brain implants or surgery. The goal isn't mind reading for the masses. Instead, it's to help people who have lost the ability to speak communicate again.

How a Brain-powered keyboard works

Read more
AI chatbots can often feed into your delusions. Researchers say you should look for three signs
Experts warn that chatbot design choices can reinforce unhealthy beliefs in vulnerable users.
ChatGPT on a smartphone

Artificial intelligence chatbots have become incredibly good at sounding human. But a new review paper by psychiatrist Marc Augustin and fellow researchers Thomas A. Pollak and Helen Morrin, published in NPP—Digital Psychiatry and Neuroscience, argues that existing AI research points to an overlooked psychological risk. The paper, highlighted by The Wall Street Journal, reviews previous studies and proposes a framework explaining how three common chatbot behaviors can combine to reinforce delusional thinking in vulnerable users, creating what the authors call an "amplification spiral."

Researchers say these are the three warning signs

Read more