Machine-learning system aggregates knowledge by surfing web for information

ai surfs the web learning tv screens
Here in 2016, we have a data problem — but it’s far from the data problem people experienced in previous decades. Instead of having a dearth of information, the problem users face today is there is simply too much information available and distilling it into one manageable place is a necessity.

That is the challenge researchers at the Massachusetts Institute of Technology set out to solve with a new piece of work, which won the “best paper” award at the Association for Computational Linguistics’ Conference on Empirical Methods on Natural Language Processing in November.

The work seeks to turn conventional machine-learning techniques upside down by offering a new approach to information extraction — which allows an AI system to turn plain text into data for statistical analysis and improve its performance by surfing the web for answers.

“This method is similar to the way that we as humans search for and find information,” Karthik Narasimhan, a graduate student at MIT’s Department of Electrical Engineering and Computer Science, told Digital Trends. “For example, if I find an article with a reference I can’t understand, I know that to understand it I need more training. Since I have access to other articles on the same topic, I’d perform a web search to get additional information from different sources to gain a more informed understanding. We want to do the same thing in an automated scenario.”

MIT’s machine-learning system works by giving information a measure of statistical likelihood. If it determines that it has low confidence about a piece of knowledge, it can automatically generate an internet search inquiry to find other texts to fill in the blanks. If it concludes that a particular document is not relevant, it will move onto the next one. Ultimately, it will extract all of the best pieces of information and merge them together.

The system was trained to extract information by being asked to compile information on mass shootings in the U.S., as part of a potential study on the effects of gun control and food contamination. In each scenario, the system was trained on around 300 documents and instructed to extract information answering a number of queries — which it managed to successfully do.

“We used a technique called reinforcement learning, whereby a system learns through the notion of reward,” Narasimhan said. “Because there is a lot of uncertainty in the data being merged — particularly where there is contrasting information — we give it rewards based on the accuracy of the data extraction. By performing this action on the training data we provided, the system learns to be able to merge different predictions in an optimal manner, so we can get the accurate answers we seek.”

Going forward, Narasimhan said that the research could have myriad applications. For instance, it could be used to scan various news reports and compile a single fact-heavy document, combining data from multiple sources.

It could equally be used in the medical profession. “This could be a great tool for aggregating patient histories,” he said. “In cases where a lot of doctors write different things about treatments a patient has gone through — and each has a different way of writing about it — this technology could be used to distill that information into a more structured database. The result could mean that doctors are able to make better, more informed decisions about a patient.”

Just another exciting, groundbreaking day in the world of machine learning!

Movies & TV

How Avengers: Infinity War’s Oscar-nominated VFX team made Thanos a movie star

The purple-skinned Thanos proved to be a breakout character in Avengers: Infinity War, thanks to the work of actor Josh Brolin and visual effects studios Digital Domain and Weta. Here's how they brought him to life and earned the film an…
Emerging Tech

Here’s how Facebook taught its Portal A.I. to think like a Hollywood filmmaker

When Facebook introduced its Portal screen-enhanced smart speakers, it wanted to find a way to make video chat as intimate as sitting down for a conversation with a friend. Here's how it did it.
Emerging Tech

Statistician raises red flag about reliability of machine learning techniques

Machine learning is everywhere in science and technology. But how reliable are these techniques really? A statistician argues that questions of accuracy and reproducibility of machine learning have not been fully addressed.

DLSS is finally arriving in games, but how does Nvidia's super-sampling actually work?

Nvidia's new DLSS technology is exciting, but what is it and how does it work? It's not quite anti-aliasing and it's not quite super sampling. It's a little bit of both and the end results can be impressive.
Emerging Tech

‘Guerrilla rainstorm’ warning system aims to prevent soakings, or worse

Japanese researchers have created a "guerrilla rainstorm" early-warning system aimed at preventing severe soakings, or worse. The team hopes to launch the system before the 2020 Tokyo Olympics.

Barbie’s Corvette ain’t got nothing on Sphero’s fully programmable robot car

Sphero is known for devices like the Sphero Bolt and BB-8 Star Wars toy, but now the company is back with another addition to its lineup -- the Sphero RVR. The RVR is a fully programmable robot car that can be expanding with different…
Emerging Tech

We tried a $500 electronic dab rig, and now we can’t go back to normal vaporizers

Induction heating is the future of cannabis vaporizers. Loto Labs wowed us with what likely is the best concentrate vaporizer on the market today. With a $500 price tag, it's expensive, but it should definitely be your next dab rig.
Emerging Tech

Japanese spacecraft will collect a sample from asteroid Ryugu by shooting at it

The Japanese spacecraft Hayabusa 2 will soon touch down on the asteroid Ryugu, where it will collect a sample by shooting a bullet into the soil. The sample will be returned to Earth in 2020 to learn about the formation of asteroids.
Emerging Tech

Hong Kong’s vision for a smart prison is a full-blown Orwellian nightmare

Hong Kong wants to bring prisons up to date by introducing new location-tracking wristbands for inmates, and a robot arm whose job is to comb through poop on the lookout for contraband.
Emerging Tech

No faking! Doctors can now objectively measure how much pain you’re in

Researchers at Indiana University School of Medicine have discovered the blood biomarkers that can objectively reveal just how much pain a patient is in. Here's why that's so important.
Emerging Tech

SeaBubbles’ new electric hydrofoil boat is the aquatic equivalent of a Tesla

What do you get if you combine a Tesla, a flying car, and a sleek boat? Probably something a bit like SeaBubbles, the French "flying" boat startup which offers a fresh spin on the hydrofoil.
Emerging Tech

Israel will launch world’s first privately funded moon mission tomorrow

This week will see the world's first privately funded lunar mission launch. Israel's first mission to the moon will be launched aboard SpaceX's Falcon 9 rocket on Thursday, February 21.
Emerging Tech

FDA warns about the dangers of anti-aging blood transfusions

It turns out injecting old people with blood from healthy youngsters may not be the answer to health rejuvenation. That’s according to the FDA, which says such claims are dangerous junk science.
Emerging Tech

Here’s where to watch this week’s SpaceX launch from Cape Canaveral

If you've been following the SpaceX launch calendar, you know this week marks the first launch from Cape Canaveral in two months. We have the details on where you can watch the launch live.