Social media data is riddled with 'human behaviour errors'

Faulty barometer

Sun 30 Nov 2014 // 11:03 UTC

Researchers who heavily rely on social media data when studying human behaviour have been warned that such information can be very easily skewed.

Computer scientists at McGill University in Montreal and Carnegie Mellon University in Pittsburgh said in a paper published yesterday in the Science magazine that academics were failing to spot the flaws in the data.

And yet, in recent years, there has been an explosion of studies on human behaviour using social media as a barometer for all kinds of predictions about the world we live in now.

"Many of these papers are used to inform and justify decisions and investments among the public and in industry and government," said McGill's assistant computer science professor Derek Ruths.

He added: "The common thread in all these issues is the need for researchers to be more acutely aware of what they're actually analysing when working with social media data."

The boffins offered up a list of "challenges" faced by researchers who glean their statistics from social media data.

Different social media platforms attract different users – Pinterest, for example, is dominated by females aged 25-34 – yet researchers rarely correct for the distorted picture these populations can produce.

Publicly available data feeds used in social media research don't always provide an accurate representation of the platform's overall data – and researchers are generally in the dark about when and how social media providers filter their data streams.

The design of social media platforms can dictate how users behave and, therefore, what behaviour can be measured. For instance, on Facebook the absence of a "dislike" button makes negative responses to content harder to detect than positive "likes".

Large numbers of spammers and bots, which masquerade as normal users on social media, get mistakenly incorporated into many measurements and predictions of human behaviour.

Researchers often report results for groups of easy-to-classify users, topics, and events, making new methods seem more accurate than they actually are. For instance, efforts to infer political orientation of Twitter users achieve barely 65 per cent accuracy for typical users – even though studies (focusing on politically active users) have claimed 90 per cent accuracy.

Despite the blindingly obvious weaknesses found in such data, Ruths remained optimistic about researchers using social media in their studies, if they tackle the problems outlined by the prof and his colleagues.

Topics

Special Features

Vendor Voice

Resources

Science

Social media data is riddled with 'human behaviour errors'

Faulty barometer

More about

More about

Narrower topics

Broader topics

More about

More about

More about

Narrower topics

Broader topics

TIP US OFF

Other stories you might like

EU tells Meta it can't paywall privacy

Europe gives TikTok 24 hours to explain 'addictive and toxic' new app

Devaluing content created by AI is lazy and ignores history

Getting on board with AI

Meta accused of snarfing people's Snapchat data via traffic decryption

Thank the bots, your blue check is back on X

Judge demands social media sites prove they didn't help radicalize mass shooter

Hong Kong promises its latest national security law is not a ban on social media

Indian court halts operations of government-run social media fact checker

Trump, who tried kicking TikTok out of the US, says boo to latest ban effort

Meta kills Facebook News in the US and Australia

We're not Meta support: State AGs tell Zuck to fix rampant account takeover problem

About Us

Our Websites

Your Privacy