Skip to main content
  1. Home
  2. Emerging Tech
  3. Computing
  4. News

Current tech for detecting hate speech is woefully inadequate, researchers find

Add as a preferred source on Google

Exactly what constitutes hate speech is one of the most hotly contested topics in 2018. One of the reasons it is so vigorously debated is because of how difficult it can be to define. If humans find hate speech difficult to define, machines find it even more more of a chore, as a new survey of seven different computer systems intended to identify such online speech makes clear. It also touches on just how easy they are to circumvent.

Researchers from Finland’s Aalto University analyzed various anti-hate speech systems, including tools built by Google’s Counter Abuse team. Their findings? Not only can the systems used to flag offensive content online not agree on a solid definition for hate speech, they can also be easily fooled with little more than a typo or letter substitution.

Recommended Videos

“Researchers and companies have suggested various text analysis and machine learning methods for automatic hate speech detection,” Gröndahl Tommi, one of the researchers on the project, told Digital Trends. “These systems are trained with examples of hateful and non-hateful text, with the goal of generalizing beyond the training examples. We applied a system trained with one data set to other data sets. We discovered that none of them worked well on other data sets. This indicates that what is called ‘hate speech’ differs a lot between existing data sets, and cannot be treated as a clearly definable property. Given this, we should not expect A.I. to replace humans completely in this task, as human labor continues to be required to make the final decisions on what constitutes hate speech proper.”

The researchers next demonstrated how all seven systems could be easily fooled by simple automatic text transformation attacks — such as making small changes to words, introducing or removing spaces, or adding unrelated words. For example, adding the word “love” into an otherwise hate-filled message confuses detection systems. These tricks were capable of fooling both straightforward keyword filters and more complex A.I. systems, based on deep-learning neural network architectures.

That today’s flagging tools are inadequate for dealing with online hate speech is no great shock. While we’ve covered some innovative cutting-edge projects in this domain, research such as this reveals just how much more work there is to do Hopefully, projects like this one will make researchers double down on the challenge, and not throw up their hands in defeat.

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
Claude can now join your Slack channels and work alongside your team
Laptop running Claude Fable

For years, AI assistants have been siloed. You open ChatGPT, Claude, Gemini, or Copilot, type a prompt, get an answer, and move on. Anthropic's new Claude Tag feature takes a different approach. Instead of making employees jump into a separate AI chat every time they need help, it brings Claude directly to where many teams already spend their day: Slack.

Add Claude to a channel, grant it access to needed tools, and tag @Claude for help — whether analyzing data, writing reports, reviewing code, or investigating incidents. But Claude Tag isn't just another chatbot integration. Its key differentiator is that Anthropic positions it as a digital coworker for your team, enabling seamless collaboration where multiple users can jointly interact with the same AI within their work environment.

Read more
Getty Images accused AI of wholesale theft. It’s now an official ChatGPT image partner.
Advertisement, Shop, Clothing

The AI industry's most fascinating stories often come from unlikely alliances, and this is certainly one of them. Getty Images, a company that has spent years raising concerns about how AI models are trained and how creative work is used, is now officially partnering with OpenAI.

The new agreement will allow Getty Images' licensed content to appear across ChatGPT's search and discovery experiences. That means users may begin seeing Getty's professionally licensed photos and visual assets integrated into ChatGPT responses, adding more visual context to searches and AI-generated answers. Getty says the goal is to make AI-powered search more useful and trustworthy by relying on high-quality, licensed content rather than the murky sourcing practices that have sparked countless debates across the AI industry.

Read more
Timekettle’s new X1 Meeting Hub does real-time translation for 50 people and fits in your pocket
Fifty participants, five languages, one 199-gram hub, and no booth required.
Electronics, Screen, Computer Hardware

Professional conference interpretation setups are notoriously painful. Dedicated booths, trained interpreters, bulky hardware, and a bill at the end of every month that makes you rethink whether the meeting was even required in the first place. 

Timekettle wants to collapse all of that into a single hub that weighs 199 grams (less than modern flagship smartphones). The company just launched the X1 Meeting Interpreter Hub. 

Read more