Skip to main content

Facebook's text understanding engine is reading thousands of posts every second

Facebook has unveiled a new piece of technology that’s either impressive or invasive depending on how you look at it. DeepText is a “text understanding engine” that can apparently read and understand posts with near-human accuracy.

A blog post published on the Facebook Code subsite stresses the enormous importance of text-based communication to the social media giant. The company’s ability to recommend relevant content and filter out spam will reportedly be vastly improved thanks to the abilities of DeepText.

Recommended Videos

Using a complex system of neural networks and deep-learning techniques, DeepText is able to understand the textual content of posts written in more than 20 languages at a rate of several thousand posts per second.

Poring over Facebook posts might not seem like the most challenging task, but difficulties arise for nonhuman readers when context comes into play. Previous attempts to create a system like DeepText were stymied by confusing elements like ambiguous terminology and slang.

Traditional methods of “teaching” a machine to read centered around the process of assigning each word to a numerical ID that’s easily recognized by a computer. However, this strategy requires words to be written exactly as in the teaching materials if the text is to be understood.

Conversely, DeepText uses a system that preserves semantic links between different terms. This means that words like ‘brother’ and ‘bro’ are connected, as well as different representations of the same term in different languages.

Facebook is an enormous service, and the company faces a big challenge in parsing all the text that makes its way onto the service and coming up with something that’s relevant. The DeepText system could be a huge help toward separating the wheat from the chaff.

In fact, DeepText is already being implemented in various Facebook services as a test of its capabilities. For instance, the system has been integrated into Messenger in such a way that it can determine when a user is looking to call a cab and offer up a means of doing so — but it will ignore any usage of words like “ride” that don’t refer to a taxi journey.

Development of DeepText is set to continue in collaboration with the Facebook AI Research group.

Brad Jones
Former Digital Trends Contributor
Brad is an English-born writer currently splitting his time between Edinburgh and Pennsylvania. You can find him on Twitter…
How to deactivate your Instagram account (or delete it)
A person holding a phone with the Instagram app open on it.

Oh, social media. Sometimes it’s just too much, folks.

If you’re finding yourself in a position where shutting down your Instagram account for a period of time sounds good, the people at Meta have made it pretty simple to deactivate it. It’s also quite easy to completely delete your Instagram, although we wouldn’t recommend this latter option if you plan on returning to the platform at a later date.

Read more
Bluesky finally adds a feature many had been waiting for
A blue sky with clouds.

Bluesky has been making a lot of progress in recent months by simplifying the process to sign up while at the same time rolling out a steady stream of new features.

As part of those continuing efforts, the social media app has just announced that users can now send direct messages (DMs).

Read more
Incogni: Recover your privacy and remove personal information from the internet
Incogni remove your personal data from brokers and more

Everything you do while online is tracked digitally. Often connected to your email address or an issued IP, trackers can easily identify financial details, sensitive information like your social security number, demographics, contact details, like a phone number or address, and much more. In many ways, this information is tied to a digital profile and then collated, recorded, and shared via data brokers. There are many ways this information can be scooped up and just as many ways, this information can be shared and connected back to you and your family. The unfortunate reality is that, for most of us, we no longer have any true privacy.

The problem is exacerbated even more if you regularly use social media, share content or images online, or engage in discussions on places like Reddit or community boards. It's also scary to think about because even though we know this information is being collected, we don't necessarily know how much is available, who has it, or even what that digital profile looks like.

Read more