Feeds

ICO orders release of (mostly useless) weather station data

CRU cuts of weather datasets released into the wild

Build a business case: developing custom apps

The Information Commissioner's Office has ordered the University of East Anglia to release a portion of a weather dataset. The University's Climatic Research Unit had shared the data with Georgia Tech but refused to release it more widely. A leading Oxford physicist, Professor Jonathan Jones, made the successful request, which the ICO has now published.

The CRUTEM data set contains gridded weather station data provided by over 100 national meteorological services. Although CRU shared at least part of the data set with sympathetic academics, it had refused it to Jones and others in 2009, claiming several justifications. One justification that it was already available. Others were that the data wasn't already available but that making it available would breach copyright, harm the suppliers and jeopardise international relations.

The ICO rejected all grounds and ordered it to be disclosed to Professor Jones.

East Anglia had claimed that CRUTEM station records and the collection of records – the database – were both covered by copyright, the latter with a term of 15 years. The UEA said it would harm its ability to commercially exploit the database. The ICO accepted that the copyright existed and pointed out that DEFRA guidelines should not prevent the data being disclosed. It added that CRU had been unable to demonstrate harm, or potential harm.

Does it matter? Not so much directly, one of the requesters explained last year. According to climate blogger Steve McIntyre, the real story was in the prolonged cover-up, and the improbable justifications, rather than anything nefarious in the code itself.

"... CRUTEM was an almost microscopically small issue in the Climategate emails – Climategate was about the Hockey Stick and its handling by IPCC, not CRUTEM. CRUTEM was mentioned in only 25 emails and, even then, often passim," explained McIntyre last year.

"My long-standing position on CRUTEM was that CRU's obstruction of data requests was most likely due to its desire to conceal that it did so little work on quality control; that the CRU result could be derived so trivially that, in effect, CRU no longer served any useful function in this field.

"The criticism from Climate Audit was that (1) CRU provided their station data as collated to 'friends' but not to potential critics; and (2) that their excuses for not providing station data were what one London reporter (not Jonathan Leake) described to me as 'deliberately deceptive'."

Dodgy science – but great literature

McIntyre's point about "negligiable quality control" was borne out in the famous HARRY_READ_ME.TXT file, a 90,000-word document recording attempts to update CRU's "flagship" product. Here a few choice excerpts:


"So.. we don't have the coefficients files (just .eps plots of something). But what are all those monthly files? DON'T KNOW, UNDOCUMENTED. Wherever I look, there are data files, no info about what they are other than their names. And that's useless..."

"Then got a mail from PJ [CRU director Phil Jones] to say we shouldn't be excluding stations inside 8km anyway – yet that's in IJC – Mitchell & Jones 2005! So there you go."


"[G]etting seriously fed up with the state of the Australian data. so many new stations have been introduced, so many false references ... so many changes that aren't documented. Every time a cloud forms I'm presented with a bewildering selection of similar-sounding sites, some with references, some with WMO codes, and some with both. And if I look up the station metadata with one of the local references, chances are the WMO code will be wrong (another station will have it) and the lat/lon will be wrong too."


"I am very sorry to report that the rest of the databases seem to be in nearly as poor a state as Australia was. There are hundreds if not thousands of pairs of dummy stations, one with no WMO and one with, usually overlapping and with the same station name and very similar coordinates. I know it could be old and new stations, but why such large overlaps if that's the case? Aarrggghhh! There truly is no end in sight."
"I am seriously worried that our flagship gridded data product is produced by Delaunay triangulation – apparently linear as well. As far as I can see, this renders the station counts totally meaningless. It also means that we cannot say exactly how the gridded data is arrived at from a statistical perspective – since we're using an off-the-shelf product that isn't documented sufficiently to say that ... Of course, it's too late for me to fix it too. Meh."

"OH FUCK THIS. It's Sunday evening, I've worked all weekend, and just when I thought it was done I'm hitting yet another problem that's based on the hopeless state of our databases. There is no uniform data integrity, it's just a catalogue of issues that continues to grow as they're found."

In Parliament's enquiries into the Climategate Affair, Graham Stringer MP was surprised to learn that the CRU team couldn't produce the same result twice.

"When I asked Oxburgh if [Keith] Briffa [CRU academic] could reproduce his own results, he said in lots of cases he couldn't," Stringer told us. "That just isn't science. It's literature. If somebody can't reproduce their own results, and nobody else can, then what is that work doing in the scientific journals?" ®

Boost IT visibility and business value

More from The Register

next story
Just TWO climate committee MPs contradict IPCC: The two with SCIENCE degrees
'Greenhouse effect is real, but as for the rest of it ...'
Asteroid's DINO KILLING SPREE just bad luck – boffins
Sauricide WASN'T inevitable, reckon scientists
Flamewars in SPAAACE: cooler fires hint at energy efficiency
Experiment aboard ISS shows we should all chill out for cleaner engines
Brit amateur payload set to complete full circle around PLANET EARTH
Ultralight solar radio tracker in glorious 25,000km almost-space odyssey
NASA Mars rover FINALLY equals 1973 Soviet benchmark
Yet to surpass ancient Greek one, however
Famous 'Dish' radio telescope to be emptied in budget crisis: CSIRO
Radio astronomy suffering to protect Square Kilometre Array
BEST BATTERY EVER: All lithium, all the time, plus a dash of carbon nano-stuff
We have found the Holy Grail (of batteries) - boffins
prev story

Whitepapers

Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.
Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
The Essential Guide to IT Transformation
ServiceNow discusses three IT transformations that can help CIO's automate IT services to transform IT and the enterprise.
Maximize storage efficiency across the enterprise
The HP StoreOnce backup solution offers highly flexible, centrally managed, and highly efficient data protection for any enterprise.