Feeds

Man City drags Big Data into Big Football

And Google+ grabs sweaty ballers for its Hangouts...

Choosing a cloud hosting partner with confidence

Open ... and Shut Football is the world's most popular sport by a crushing margin. Yet for all the money and attention it gets, the beautiful game has remained doggedly anti-technology, eschewing video replays or goal-line technology despite the prevalence of such tools in other sports. One club, however, is opting to make technology a central hallmark of its game, and may well force other clubs to follow suit.

Unfortunately, that club is not my Arsenal.

No, it's Manchester City, the perennial "other club" in Manchester, which in recent years through barrels-full of oil money has bought an all-star team (and a title to go with it). The Citizens (or "Man City"), as the team is colloquially called, have recently set some soccer industry firsts, opening up their player data and more recently establishing intimate fan forums on Google+ Hangouts.

The Google+ Hangouts move is cute but unlikely to move the needle much on Man City's aspiration to retain its Premiership title. After all, only 10 participants can engage with Man City's chosen representatives (this week it's Judas Iscariot… former Arsenal star and now a Man City executive Patrick Vieira). The hangout is streamed live so others can watch, but it remains an interesting but not necessarily sport-changing sop to fans.

Man City's Big Data initiative, however, is much more promising.

Presented in .csv format, Man City's data trove is a "time-coded feed that lists all player action events within the game with a player, team, event type, minute and second for each action, together with the x/y/z co-ordinates for each event", thereby enabling "heat map, touch map, passing matrices and mapping attacking play and distribution".

This is the same data that Man City credits with guiding it toward the best defensive record in the Premiership for the past two seasons. (The more cynical among us would argue that no amount of data could have helped the old cash-strapped Man City to that record, but why quibble just because my own club refuses TO SPEND SOME !%!%!% MONEY, ARSENE!!!)

Such data has been available for a fee to others, but hasn't been widely accessible to the public. This has now changed, and will likely spur other clubs to take similar actions.

Given that professionals have already had access to this data, the real value isn't in gifting opposing teams data about one's performance. It's rather a way to reach out to as-yet unknown but potentially useful sources of data crunching. As Gavin Fleig, Man City's head of performance analysis, explains, the decision to open up its player data is a way to enable the "Bill Jameses" of the world – James being the man whose sabermetric work in the 1970s arguably revolutionised baseball:

Bill James kick-started the analytics revolution in baseball. That made a real difference and has become integrated in that sport. Somewhere in the world there is football's Bill James, who has all the skills and wants to use them but hasn't got the data. We want to help find that Bill James, not necessarily for Manchester City but for the benefit of analytics in football. I don't want to be at another analytics conference in five years' time talking to people who would love to analyse the data but cannot develop their own concepts because all the data is not publicly available.

This is a great example of one cardinal principal of Big Data: "Don't throw any data away." Data storage is cheap while analysis of these data is potentially priceless, even (pardon the pun) game-changing.

Personally, I'm hopeful that the Citizens' data will be used by some enterprising Arsenal fan to crack the code on how to spend peanuts and still crack at least one Manchester club's stranglehold on the top of football. But whether or not this pipe dream materialises, it's impressive how Man City is challenging soccer's technology status quo, embracing technology as a way to interact with fans and anonymous data scientists. I don't love the team, but I love its technology leadership. ®

Matt Asay is senior vice president of business development at Nodeable, offering systems management for managing and analysing cloud-based data. He was formerly SVP of biz dev at HTML5 start-up Strobe and chief operating officer of Ubuntu commercial operation Canonical. With more than a decade spent in open source, Asay served as Alfresco's general manager for the Americas and vice president of business development, and he helped put Novell on its open source track. Asay is an emeritus board member of the Open Source Initiative (OSI). His column, Open...and Shut, appears three times a week on The Register.

Protecting users from Firesheep and other Sidejacking attacks with SSL

More from The Register

next story
Phones 4u slips into administration after EE cuts ties with Brit mobe retailer
More than 5,500 jobs could be axed if rescue mission fails
JINGS! Microsoft Bing called Scots indyref RIGHT!
Redmond sporran metrics get one in the ten ring
Driving with an Apple Watch could land you with a £100 FINE
Bad news for tech-addicted fanbois behind the wheel
Murdoch to Europe: Inflict MORE PAIN on Google, please
'Platform for piracy' must be punished, or it'll kill us in FIVE YEARS
Phones 4u website DIES as wounded mobe retailer struggles to stay above water
Founder blames 'ruthless network partners' for implosion
Found inside ISIS terror chap's laptop: CELINE DION tunes
REPORT: Stash of terrorist material found in Syria Dell box
Sony says year's losses will be FOUR TIMES DEEPER than thought
Losses of more than $2 BILLION loom over troubled Japanese corp
prev story

Whitepapers

Providing a secure and efficient Helpdesk
A single remote control platform for user support is be key to providing an efficient helpdesk. Retain full control over the way in which screen and keystroke data is transmitted.
WIN a very cool portable ZX Spectrum
Win a one-off portable Spectrum built by legendary hardware hacker Ben Heck
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.
Protecting users from Firesheep and other Sidejacking attacks with SSL
Discussing the vulnerabilities inherent in Wi-Fi networks, and how using TLS/SSL for your entire site will assure security.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.