Skip to main content

Newly developed AI system can accurately judge a book by its cover

many old books in a book shop or library
123RF/Yulia Grogoryeva
The tech world sure loves to disrupt conventional wisdom. Its latest victim? The old adage that you should never judge a book by its cover.

With disproving that sentiment in mind, researchers at Japan’s Kyushu University have trained a neural network to be able to predict which genre a book falls into simply by studying its cover.

“The purpose of this work is to determine if machines can learn the meaning behind book covers without textual clues,” researcher and paper co-author Brian Kenji Iwana told Digital Trends. “For this study, we took book cover images and classified them by genre using an artificial neural network. We also look at some of the hidden design rules of the covers found by the network.”

judging-a-book-graph
Kyushu University

For their dataset, Iwana and colleague Seiichi Uchida used a total of 137,788 book covers for titles available for sale on Amazon. These fell into 20 different categories, and was simplified slightly by only using the primary category a book was listed under, in instances where it fell under multiple genre headings.

Eighty percent of this data was then used to train the four-layer neural network the pair used, thereby leaving 20 percent for validating and testing it.

More than 40 percent of the time, the algorithm was able to place the correct genre within its three best guesses, while it predicted the right genre first guess upward of 20 percent of the time.

Unfortunately, the pair didn’t research how well humans do at the classification task (which is relatively straightforward for a genre like cookery books, but tougher when it comes to broader genres like biographies or memoirs). However, the results of the algorithm show significantly better results than just a random guess.

“The idea came from our previous work with font and document recognition,” Iwana said. “We are particularly interested in pushing the field of machine learning into tasks that traditionally require human feelings, such as impression and design.”

There are multiple possible applications for this research. It could, for instance, be used to help classify digitized books in cases where labelled data is lacking. It could also (creative-minded designers beware!) be used to help find “rules” that more easily visually describe what a book is about — helpful for both machines and bookstore-browsing humans alike.

Longer term, it even opens up the possibility of algorithms being able to generate cover concepts by themselves.

“Our work shows that it’s possible to use machines to learn the relationship between book covers and genre,” Iwana concluded. “This can lead to tools used to help authors design book covers or to automate genre prediction. It’s one step closer to bringing machine learning into the field of design.”

Editors' Recommendations

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
This AI can spoof your voice after just three seconds
man speaking into phone

Artificial intelligence (AI) is having a moment right now, and the wind continues to blow in its sails with the news that Microsoft is working on an AI that can imitate anyone’s voice after being fed a short three-second sample.

The new tool, dubbed VALL-E, has been trained on roughly 60,000 hours of voice data in the English language, which Microsoft says is “hundreds of times larger than existing systems”. Using that knowledge, its creators claim it only needs a small smattering of vocal input to understand how to replicate a user’s voice.

Read more
Meta made DALL-E for video, and it’s both creepy and amazing
A video created via AI, featuring a creature typing in a hat.

Meta unveiled a crazy artificial intelligence model that allows users to turn their typed descriptions into video. The system is called Make-A-Video and is the latest in a trend of AI generated content on the web.

The system accepts short descriptions like "a robot surfing a wave in the ocean” or "clown fish swimming through the coral reef" and dynamically generates a short GIF of the description. There are even three different styles of videos to choose from: surreal, realistic, and stylized.

Read more
I pitched my ridiculous startup idea to a robot VC
pitched startup to robot vc waterdrone

Aqua Drone. HighTides. Oh Water Drone Company. H2 Air. Drone Like A Fish. Whatever I called it, it was going to be big. Huge. Well, probably.

It was the pitch for my new startup, a company that promised to deliver one of the world’s most popular resources in the most high-tech way imaginable: an on-demand drone delivery service for bottled water. In my mind I was already picking out my Gulfstream private jet, bumping fists with Apple’s Tim Cook, and staging hostile takeovers of Twitter. I just needed to convince a panel of venture capitalists that I (and they) were onto a good thing.

Read more