The Register® — Biting the hand that feeds IT

Feeds

Worldwide Gmail crash was due to Google Sync bug

Engineer: How Oompah-Loompah config foul-up pulled down world's email

Cloud based data management

It was a Googler misconfiguring a sync server that took out Chrome and Gmail on Monday, an engineer has stated on dev forums.

The reason for Gmail's brief burnout on Monday has been winkled out, and it was connected to the rolling crashes suffered on the Chrome browser on the same day.

It was a human error in configuring a Chrome component that throttled traffic in Chrome and then on several other Google services, explained Chrome engineer Tim Steele in a post on the Google code forum.

The component controlled traffic for Chrome Sync - a service that allows users to synchronise their customised Chrome browser across all of their devices, giving them the same bookmarks, widgets, settings and browsing history.

A small change to its configuration settings meant the load-balancing component started to throttle traffic when it wasn't supposed to. And because the component is core to the infrastructure that many Google services depend on, it affected them too.

It was Chrome Sync users who were hit by the outage first.

Steele explained:

- Chrome Sync Server relies on a backend infrastructure component to enforce quotas on per-datatype sync traffic.
- That quota service experienced traffic problems today due to a faulty load balancing configuration change.
- That change was to a core piece of infrastructure that many services at Google depend on. This means other services may have been affected at the same time, leading to the confounding original title [the Gmail bug] of this bug.
- Because of the quota service failure, Chrome Sync servers reacted too conservatively by telling clients to throttle "all" data types, without accounting for the fact that not all client versions support all data types.

The crash is due to faulty logic responsible for handling "throttled" data types on the client when the data types are unrecognized.

There is WRONG in the cloud. ®

Agentless Backup is Not a Myth

Thankfully I was not affected, as I lost access to my account, and google refuse to help me, insisting instead that they are happy I have gotten access to my account again, and not replying to follow-up mail.

I now run my own mail server, since about a year, that has not let me down yet.

4
1
Anonymous Coward

CORPORATE BROTHERHOOD FAIL: The usual response is:

"We had a small problem with one of our core services that resulted in a brief outage for some of our customers. This is now resolved and we apologise to our users for the inconvenience and would like to reassure them of a return to our normal sterling service."

3
0
Anonymous Coward

Re: i cant believe some cunt gets away with this

i want to know that he was fired and/or killed

I see getting the dosage right is still somewhat of a problem.. Shoot someone for a human error?

2
0

More from The Register

SCO vs. IBM battle resumes over ownership of Unix
Zombie lawsuit back and wants to suck the brains out of Linux
 breaking news
You don't need phone lines or cable for ANYTHING, says Dish
The satellite-dish man can sort you out with phone and broadband over the air too
 breaking news
What's HP got under wraps? Looks awfully flash and tape shaped
What happens in Vegas won't stay there - we've got the details
Microsoft borks botnet takedown in Citadel snafu
Stupid Redmond kicked over our honeypots, wail white hats
IBM's $1bn layoffs latest: Now axe swings in US, Canada - reports
Union claims 121 storage bods canned after dismal sales
NetApp musters muscular cluster bluster for ONTAP busters
Storage array OS overhauled to juggle more nodes, go down on you, er, less
HP adds 'Haswell' Xeon E3s to entry ProLiant servers
Gussies up MicroServer for SMBs, adds baby switches