Skip to main content

IBM is cutting deep-learning processing times from days down to hours

deep learning
Daniel Kaesler/123RF
Deep learning uses algorithms inspired by the way human brains operate to put computers to work on tasks too big for organic gray matter. On Monday, IBM announced that a new record for the performance of a large neural network working with a large data set.

The company’s new deep-learning software brings together more than 256 graphics processing units across 64 IBM Power systems. The speed improvements brought about by the research come as a result of better communication between the array of GPUs.

Recommended Videos

Faster GPUs provide the necessary muscle to take on the kind of large scale problems today’s deep-learning systems are capable of tackling. However, the faster the components are, the more difficult it is to ensure that they are all working together as one cohesive unit.

Please enable Javascript to view this content

As individual GPUs work on a particular problem, they share their learning with the other processors that make up the system. Conventional software is not capable of keeping up with the speed of current GPU technology, which means that time is wasted as they wait around for one another’s results.

Hillery Hunter, IBM’s director of systems acceleration and memory, compared the situation to the well-known parable of the blind men and the elephant. The company’s distributed deep-learning project has resulted in an API that developers can be used in conjunction with deep-learning frameworks to scale to multiple servers, making sure that their GPUs remain synchronized.

IBM recorded image recognition accuracy of 33.8 percent on a test run using 7.5 million images from the ImageNet-22K database. The previous best-published result was 29.8 percent, which was posted by Microsoft in October 2014 — in the past, accuracy has typically edged forward at a rate of about one percent in new implementations, so an improvement of four percent is considered to be a very good result.

Crucially, IBM’s system managed to achieve this in seven hours; the process that allowed Microsoft to set the previous record took 10 days to complete.

“Speed and scalability, which means higher accuracy, means that we can quickly retrain an AI model after there is a new cyber-security hack or a new fraud situation,” Hunter told Digital Trends. “Waiting for days or weeks to retrain the model is not practical — so being able to train accurately and within hours makes a big difference.”

These massive improvements in terms of speed, combined with advances in terms of accuracy make IBM’s distributed deep-learning software a major boon for anyone working with this technology. A technical preview of the API is available now as part of the company’s PowerAI enterprise deep-learning software.

Brad Jones
Former Digital Trends Contributor
Brad is an English-born writer currently splitting his time between Edinburgh and Pennsylvania. You can find him on Twitter…
Indiana Jones and the Great Circle proves Nvidia wrong about 8GB GPUs
Indiana jones buried in the sand.

Nvidia was wrong, and Indiana Jones and the Great Circle is proof of that. Despite being a game that's sponsored by Nvidia due to its use of full ray tracing -- which is said to arrive on December 9 -- multiple of Nvidia's best graphics cards struggle to maintain a playable frame rate in the game, and that largely comes down to VRAM.

Computer Base tested a swath of GPUs in the game across resolutions with the highest graphics preset, and one consistent trend emerged. Any GPUs packing less than 12GB of VRAM couldn't even maintain 30 frames per second (fps) in the game at its highest graphics settings. That led to some wild comparisons as you can see in the chart below. The Intel Arc A770, for example, which is a budget-focused 1080p graphics card, beats the RTX 3080, which was the 4K champion when it launched. Why? The A770 has 16GB of VRAM, while the RTX 3080 has 10GB.

Read more
44 cool tech gifts to impress that special nerd in your life
Arcade1Up arcade cabinets

A quick scroll through big retailers like Amazon or Target will show you a large variety of cool tech gifts discounted for this holiday season -- a frankly overwhelming amount, in fact. Sifting through all the smartwatches, phones, tablets, and appliances yourself to find that cool tech gift for your uncle would take a lot of time that you probably don't have this holiday season.

Luckily, we're here to run down 44 cool tech gadgets that make awesome gifts for the holiday season, all of which are sure to impress your friends and loved ones this holiday. In putting this piece together, we threw in the usual tech items that you expect, like gaming equipment, headphones, and tablets, but also some off-the-wall offerings that you probably wouldn't think to buy for yourself. (After all, isn't that the point of gift-giving?) No matter what kind of tech gift you're looking for, we've got you covered.

Read more
The 40 best tech gifts under $100 for the one who has everything
A burning Solo Stove

Every year, it's inevitable that you run into the same problem: What do you get for your friend or relative that already has everything or that you simply don't know very well? Tech gifts under $100 are a great safe option -- after all, most of us could use a new keyboard or pair of headphones, even if our old ones are still working reasonably well.

In crafting the list below, we scoured our favorite retailers to find a variety of great items in a variety of styles and price points. We included plenty of tech gifts under $100 you'll definitely expect, like speakers, earbuds, and lots of gaming gear, but we also threw in some products you might not ever find for yourself, like a smokeless fire pit, or even a blender. The bottom line is this: scroll down below, and you'll be able to find a tech gift under $100 idea for even the most difficult-to-shop-for friend.

Read more