Feeds

Gone in 30 minutes: Chinese tweets purged by army of censors

New report claims thousands of censors could be working for Sina Weibo

Gartner critical capabilities for enterprise endpoint backup

The murky world of online self-censorship in China has come under the spotlight again in a new report which estimates that most post deletions on the Twitter-like Sina Weibo occur within the first 30 minutes of appearing.

The Velocity of Censorship: High-Fidelity Detection of Microblog Post Deletions, was researched by academics at Bowdoin College, Rice University and the University of New Mexico alongside independent researcher Tao Zhu(h/t MIT Technology Review).

Sina claims its service has over 500 million users, but for the purposes of this research the team concentrated on the posts of around 3,500 “sensitive” users with a track record of censorship.

Developing a system “which collects removed posts on targeted users in almost real time”, the researchers found that roughly 12 per cent of posts were deleted over the 15 day monitoring period – which amounts to more than 4,500 every day.

The research included the following observation(PDF):

Our research found that deletions happen most heavily in the first hour after a post has been made. Especially for original posts that are not reposts, most deletions occur within 5-30 minutes, accounting for 25 per cent of the total deletions of such posts. Nearly 90 per cent of the deletions of such posts happen within the first 24 hours of the post.

To enable such speedy censorship, the report claims a mixture of technical and non-technical filtering is used, with potentially thousands of staff employed to eyeball content, as per the following hypothesis:

The deletions happen most heavily for a regular post within 5 to 10 minutes of it being posted. Suppose an efficient worker can read 50 posts per minute, including the reposts and figures included in the posts. Then to read Weibo’s full 70,000 new posts in one minute, 1,400 workers working at the same time would be needed. If these workers only worked in 8 hour shifts, 4,200 workers would then be required.

Proactive keyword filtering blocks certain posts before they have gone live, or holds them for human review, while a range of retroactive mechanisms including backwards keyword and repost searches, public timeline filtering and monitoring of specific censorship-prone individuals were also highlighted in the report.

The research also hypothesises that the censors work “relatively independently, in a distributed fashion”, with activity only really dipping between around 1-7am and again slightly at 7pm – which the report authors claim could be due to the national TV news programme broadcast at that time.

Although the report casts new light on the speed and accuracy of China’s web censors, it doesn’t explain why more isn’t done to block potentially illegal content before it is even posted.

One possible answer came from Sina Weibo manager @geniune_Yu_Yang (正版于洋), who – apparently frustrated by user anger directed at the company’s army of censors - wrote an illuminating post of his own back in January.

He effectively argued that Sina is trying to work around the strict regulations forced upon it by government, by at least letting users see and disseminate their content for a few minutes before it is deleted.

He wrote:

You can see the messages before they are deleted, right? You still have your account functioning, right? You are all experienced netizens, you know that the technology allows us to delete messages in a second. Please think carefully on this.

Now, there is no way of proving whether this manager was engaging in a crafty piece of well-timed PR or if there’s some truth to his claims.

Somewhat ironically, his post too was deleted, which illustrates perfectly the central problem with censorship of this kind: there's no way of telling whether a piece of content is deleted because it was true, or because it wasn't. ®

Boost IT visibility and business value

More from The Register

next story
Munich considers dumping Linux for ... GULP ... Windows!
Give a penguinista a hug, the Outlook's not good for open source's poster child
UK fuzz want PINCODES on ALL mobile phones
Met Police calls for mandatory passwords on all new mobes
e-Borders fiasco: Brits stung for £224m after US IT giant sues UK govt
Defeat to Raytheon branded 'catastrophic result'
Yes, but what are your plans if a DRAGON attacks?
Local UK gov outs most ridiculous FoI requests...
Detroit losing MILLIONS because it buys CHEAP BATTERIES – report
Man at hardware store was right: name brands DO last longer
Snowden on NSA's MonsterMind TERROR: It may trigger cyberwar
Plus: Syria's internet going down? That was a US cock-up
UK government accused of hiding TRUTH about Universal Credit fiasco
'Reset rating keeps secrets on one-dole-to-rule-them-all plan', say MPs
Caught red-handed: UK cops, PCSOs, specials behaving badly… on social media
No Mr Fuzz, don't ask a crime victim to be your pal on Facebook
EU justice chief blasts Google on 'right to be forgotten'
Don't pretend it's a freedom of speech issue – interim commish
prev story

Whitepapers

5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Build a business case: developing custom apps
Learn how to maximize the value of custom applications by accelerating and simplifying their development.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.