Skip to main content

Current tech for detecting hate speech is woefully inadequate, researchers find

Exactly what constitutes hate speech is one of the most hotly contested topics in 2018. One of the reasons it is so vigorously debated is because of how difficult it can be to define. If humans find hate speech difficult to define, machines find it even more more of a chore, as a new survey of seven different computer systems intended to identify such online speech makes clear. It also touches on just how easy they are to circumvent.

Researchers from Finland’s Aalto University analyzed various anti-hate speech systems, including tools built by Google’s Counter Abuse team. Their findings? Not only can the systems used to flag offensive content online not agree on a solid definition for hate speech, they can also be easily fooled with little more than a typo or letter substitution.

“Researchers and companies have suggested various text analysis and machine learning methods for automatic hate speech detection,” Gröndahl Tommi, one of the researchers on the project, told Digital Trends. “These systems are trained with examples of hateful and non-hateful text, with the goal of generalizing beyond the training examples. We applied a system trained with one data set to other data sets. We discovered that none of them worked well on other data sets. This indicates that what is called ‘hate speech’ differs a lot between existing data sets, and cannot be treated as a clearly definable property. Given this, we should not expect A.I. to replace humans completely in this task, as human labor continues to be required to make the final decisions on what constitutes hate speech proper.”

The researchers next demonstrated how all seven systems could be easily fooled by simple automatic text transformation attacks — such as making small changes to words, introducing or removing spaces, or adding unrelated words. For example, adding the word “love” into an otherwise hate-filled message confuses detection systems. These tricks were capable of fooling both straightforward keyword filters and more complex A.I. systems, based on deep-learning neural network architectures.

That today’s flagging tools are inadequate for dealing with online hate speech is no great shock. While we’ve covered some innovative cutting-edge projects in this domain, research such as this reveals just how much more work there is to do Hopefully, projects like this one will make researchers double down on the challenge, and not throw up their hands in defeat.

Editors' Recommendations

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
Twitter finally banned hate speech against religious groups. Will it help?

Twitter will expand its rules to ban hateful conduct made against religious groups, the social network announced Tuesday.

The new rules, announced by Twitter Safety in a blog post, will require the company to remove any tweet which “dehumanizes whole religious groups.” 

Read more
Facebook launches new changes against hate and discrimination. Are they enough?
Facebook

Amid scandals over ad discrimination and hate speech, Facebook is launching a series of changes. Civil rights leader Laura Murphy recently finished the company’s second civil rights audit, and referred to the changes as a “systematic, cross-functional framework to address these issues over time.” Critics, however, have already voiced concerns that the platform isn’t doing enough to tackle the issues.

The report, the second following an initial report in December 2018, focuses on the social network’s enforcement against hate speech, discrimination in ads, and tackling of misinformation. A third and final report is expected to be released in early 2020. As part of the report, Murphy talked with more than 90 civil rights organizations, as well as Facebook leaders and policy teams. The report both identifies the changes Facebook is making and areas for further improvement.

Read more
Digital Trends’ Top Tech of CES 2023 Awards
Best of CES 2023 Awards Our Top Tech from the Show Feature

Let there be no doubt: CES isn’t just alive in 2023; it’s thriving. Take one glance at the taxi gridlock outside the Las Vegas Convention Center and it’s evident that two quiet COVID years didn’t kill the world’s desire for an overcrowded in-person tech extravaganza -- they just built up a ravenous demand.

From VR to AI, eVTOLs and QD-OLED, the acronyms were flying and fresh technologies populated every corner of the show floor, and even the parking lot. So naturally, we poked, prodded, and tried on everything we could. They weren’t all revolutionary. But they didn’t have to be. We’ve watched enough waves of “game-changing” technologies that never quite arrive to know that sometimes it’s the little tweaks that really count.

Read more