Skip to main content

Gender can be accurately guessed from a single Tweet

twitter
Image used with permission by copyright holder

Researchers with the Mitre corporation have developed a method to accurately guess gender by isolating specific words in a tweet. Twitter does not collect gender within profiles making this a perfect testing ground for the algorithm. The team first collected the  location, description, profile name and real name of all Twitter users in the sample. Most of the Twitter users in the sample size has only posted one time on the social service. An opening test was to see if the algorithm could detect a person’s gender from the name and the computer was able to guess correctly 89 percent of the time.

By analyzing only the content of a single tweet, the algorithm was able to guess gender correctly nearly 66 percent of the time. Analyzing all the tweets in a user’s stream increased accuracy to a bit over 75 percent. Other results included about 71 percent accuracy on just the description and 77 percent accuracy on the screen name. When combining all four fields with the tweets, the computer had a 92 percent accuracy rating.

tweetPunctuation often popped up as an indication of gender. Usage of a smiley face or an exclamation point typically indicated that the gender is female. Females are also more likely to use words like “love”, “cute”, “happy”, “mommy”, “sleep”, “school”, “baby”, “bed”, “chocolate” and “hate” as well as Internet slang like “LOL” and “OMG”. Males only had a couple phrases attributed to them including “http” and “google”.

The study also showed clear gender lines for “possessive bigrams”, a phrase that starts with “my” or “our”. Phrases attributed to males included “my wife”, “my gf” and “my beer”.  Females most commonly used “my yogurt” and “my husband”. These phrases were also analyzed to identify political identification. Tweets about yoga, vegetarians and the Los Angeles Lakers are most likely to come from Democrats while tweets about Walmart, weapons and LSU are most likely to come from Republicans.

This algorithm would be useful to anyone attempting to reach a specific audience on Twitter, namely brands and businesses attempting to market themselves to the Twitter audience.

Editors' Recommendations

Mike Flacy
By day, I'm the content and social media manager for High-Def Digest, Steve's Digicams and The CheckOut on Ben's Bargains…
WhatsApp now lets you send self-destructing voice messages
WhatsApp logo on a phone.

If you’re on WhatsApp and regularly make use of the view once feature for photo and video messages, then you might be interested to learn that the feature has now been expanded to voice messages.

WhatsApp’s view once feature does what it says, deleting a message after it’s been viewed a single time. It’s been available for photos and videos since 2021, but now you can also send voice messages that can only be played once before they, too, disappear from the app.

Read more
X rival Threads could be about to get millions of more users
Instagram Threads app.

Threads -- Meta’s rival to X, formerly Twitter -- has just launched in the European Union (EU), a market with nearly half a billion people.

The app launched in the U.S. to much fanfare in July, with Meta hoping to attract X users disillusioned with the turbulence on the platform since Elon Musk acquired it for $44 billion 14 months ago.

Read more
X (formerly Twitter) returns after global outage
A white X on a black background, which could be Twitter's new logo.

X, formerly known as Twitter, went down for about 90 minutes for users worldwide early on Thursday ET.

Anyone opening the social media app across all platforms was met with a blank timeline. On desktop, users saw a message that simply read, "Welcome to X," while on mobile the app showed suggestions for accounts to follow.

Read more