Skip to main content

Newly developed AI system can accurately judge a book by its cover

many old books in a book shop or library
123RF/Yulia Grogoryeva
The tech world sure loves to disrupt conventional wisdom. Its latest victim? The old adage that you should never judge a book by its cover.

With disproving that sentiment in mind, researchers at Japan’s Kyushu University have trained a neural network to be able to predict which genre a book falls into simply by studying its cover.

“The purpose of this work is to determine if machines can learn the meaning behind book covers without textual clues,” researcher and paper co-author Brian Kenji Iwana told Digital Trends. “For this study, we took book cover images and classified them by genre using an artificial neural network. We also look at some of the hidden design rules of the covers found by the network.”

judging-a-book-graph
Kyushu University

For their dataset, Iwana and colleague Seiichi Uchida used a total of 137,788 book covers for titles available for sale on Amazon. These fell into 20 different categories, and was simplified slightly by only using the primary category a book was listed under, in instances where it fell under multiple genre headings.

Eighty percent of this data was then used to train the four-layer neural network the pair used, thereby leaving 20 percent for validating and testing it.

More than 40 percent of the time, the algorithm was able to place the correct genre within its three best guesses, while it predicted the right genre first guess upward of 20 percent of the time.

Unfortunately, the pair didn’t research how well humans do at the classification task (which is relatively straightforward for a genre like cookery books, but tougher when it comes to broader genres like biographies or memoirs). However, the results of the algorithm show significantly better results than just a random guess.

“The idea came from our previous work with font and document recognition,” Iwana said. “We are particularly interested in pushing the field of machine learning into tasks that traditionally require human feelings, such as impression and design.”

There are multiple possible applications for this research. It could, for instance, be used to help classify digitized books in cases where labelled data is lacking. It could also (creative-minded designers beware!) be used to help find “rules” that more easily visually describe what a book is about — helpful for both machines and bookstore-browsing humans alike.

Longer term, it even opens up the possibility of algorithms being able to generate cover concepts by themselves.

“Our work shows that it’s possible to use machines to learn the relationship between book covers and genre,” Iwana concluded. “This can lead to tools used to help authors design book covers or to automate genre prediction. It’s one step closer to bringing machine learning into the field of design.”

Editors' Recommendations

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
ChatGPT behind influx of AI-written books on Amazon
Close up of ChatGPT and OpenAI logo.

Wannabe novelists who don’t want to put in the time or effort to create their literary masterpiece are turning to ChatGPT for help.

A number of recent reports reveal how OpenAI’s AI-powered chatbot is already showing up as co-author for more than 200 books in the self-published section of Amazon’s online bookstore. And they’re only the ones where ChatGPT is credited.

Read more
I’ve seen the (distant) future of AI web search – here’s where it’s amazing, and where it struggles
Bing copilot AI chat interface.

The aggressiveness with which artificial intelligence (AI) moved from the realm of theoretical power into real-world consumer-ready products is astonishing. For several years now, and up until a couple of months ago when OpenAI's ChatGPT broke onto the scene, companies from the titans of Microsoft and Google down to myriad startups espoused the benefits of AI with little practical application of the tech to back it up. Everyone knew AI was a thing, but most didn't actually utilize it.

Just a handful of weeks after announcing an investment in OpenAI, Microsoft launched a publicly-accessible beta version of its Bing search engine and Edge browser powered by the same technology that has made ChatGPT the talk of the town. ChatGPT itself has been a fun thing to play with, but launching something far more powerful and fully integrated into consumer products like Bing and Edge is an entirely new level of exposure for this tech. The significance of this step cannot be overstated.
ChatGPT felt like a toy; having the same AI power applied to a constantly-updated search database changes the game.
Microsoft was kind enough to provide me with complete access to the new AI "copilot" in Bing. It only takes a few minutes of real-world use to understand why Microsoft (and seemingly every other tech company) is excited about AI. Asking the new Bing open-ended questions about planning a vacation, setting up a week of meal plans, or starting research into buying a new TV and having the AI guide you to something useful, is powerful. Anytime you have a question that would normally require pulling information from multiple sources, you'll immediately streamline the process and save time using the new Bing.
Let AI do the work for you
Not everyone wants to show up to Google or Bing ready to roll up their sleeves and get into a multi-hour research session with lots of open tabs, bookmarks, and copious amounts of reading. Sometimes you just want to explore a bit, and have the information delivered to you -- AI handles that beautifully. Ask one multifaceted question and it pulls the information from across the internet, aggregates it, and serves it to you in one text box. If it's not quite right, you can ask follow-up questions contextually and have it generate more finely-tuned results.

Read more
Forget Dall-E, you can sign up to create AI-generated videos now
A frame from an AI-generated video in claymation style.

Dall-E, ChatGPT, and other AI-generation technologies continue to amaze us. Still, AI image-generation tools like Midjourney might seem boring once you see the new, AI-powered video-generation abilities that will soon be available to us all.

Runway provides an advanced online video editor that offers many of the same features as a desktop app. The company has distinguished its service from others, however, by pioneering the use of AI tools that help with various time-consuming video chores, such as masking out the background.

Read more