Feeds

ICO orders release of (mostly useless) weather station data

CRU cuts of weather datasets released into the wild

Secure remote control for conventional and virtual desktops

The Information Commissioner's Office has ordered the University of East Anglia to release a portion of a weather dataset. The University's Climatic Research Unit had shared the data with Georgia Tech but refused to release it more widely. A leading Oxford physicist, Professor Jonathan Jones, made the successful request, which the ICO has now published.

The CRUTEM data set contains gridded weather station data provided by over 100 national meteorological services. Although CRU shared at least part of the data set with sympathetic academics, it had refused it to Jones and others in 2009, claiming several justifications. One justification that it was already available. Others were that the data wasn't already available but that making it available would breach copyright, harm the suppliers and jeopardise international relations.

The ICO rejected all grounds and ordered it to be disclosed to Professor Jones.

East Anglia had claimed that CRUTEM station records and the collection of records – the database – were both covered by copyright, the latter with a term of 15 years. The UEA said it would harm its ability to commercially exploit the database. The ICO accepted that the copyright existed and pointed out that DEFRA guidelines should not prevent the data being disclosed. It added that CRU had been unable to demonstrate harm, or potential harm.

Does it matter? Not so much directly, one of the requesters explained last year. According to climate blogger Steve McIntyre, the real story was in the prolonged cover-up, and the improbable justifications, rather than anything nefarious in the code itself.

"... CRUTEM was an almost microscopically small issue in the Climategate emails – Climategate was about the Hockey Stick and its handling by IPCC, not CRUTEM. CRUTEM was mentioned in only 25 emails and, even then, often passim," explained McIntyre last year.

"My long-standing position on CRUTEM was that CRU's obstruction of data requests was most likely due to its desire to conceal that it did so little work on quality control; that the CRU result could be derived so trivially that, in effect, CRU no longer served any useful function in this field.

"The criticism from Climate Audit was that (1) CRU provided their station data as collated to 'friends' but not to potential critics; and (2) that their excuses for not providing station data were what one London reporter (not Jonathan Leake) described to me as 'deliberately deceptive'."

Dodgy science – but great literature

McIntyre's point about "negligiable quality control" was borne out in the famous HARRY_READ_ME.TXT file, a 90,000-word document recording attempts to update CRU's "flagship" product. Here a few choice excerpts:


"So.. we don't have the coefficients files (just .eps plots of something). But what are all those monthly files? DON'T KNOW, UNDOCUMENTED. Wherever I look, there are data files, no info about what they are other than their names. And that's useless..."

"Then got a mail from PJ [CRU director Phil Jones] to say we shouldn't be excluding stations inside 8km anyway – yet that's in IJC – Mitchell & Jones 2005! So there you go."


"[G]etting seriously fed up with the state of the Australian data. so many new stations have been introduced, so many false references ... so many changes that aren't documented. Every time a cloud forms I'm presented with a bewildering selection of similar-sounding sites, some with references, some with WMO codes, and some with both. And if I look up the station metadata with one of the local references, chances are the WMO code will be wrong (another station will have it) and the lat/lon will be wrong too."


"I am very sorry to report that the rest of the databases seem to be in nearly as poor a state as Australia was. There are hundreds if not thousands of pairs of dummy stations, one with no WMO and one with, usually overlapping and with the same station name and very similar coordinates. I know it could be old and new stations, but why such large overlaps if that's the case? Aarrggghhh! There truly is no end in sight."
"I am seriously worried that our flagship gridded data product is produced by Delaunay triangulation – apparently linear as well. As far as I can see, this renders the station counts totally meaningless. It also means that we cannot say exactly how the gridded data is arrived at from a statistical perspective – since we're using an off-the-shelf product that isn't documented sufficiently to say that ... Of course, it's too late for me to fix it too. Meh."

"OH FUCK THIS. It's Sunday evening, I've worked all weekend, and just when I thought it was done I'm hitting yet another problem that's based on the hopeless state of our databases. There is no uniform data integrity, it's just a catalogue of issues that continues to grow as they're found."

In Parliament's enquiries into the Climategate Affair, Graham Stringer MP was surprised to learn that the CRU team couldn't produce the same result twice.

"When I asked Oxburgh if [Keith] Briffa [CRU academic] could reproduce his own results, he said in lots of cases he couldn't," Stringer told us. "That just isn't science. It's literature. If somebody can't reproduce their own results, and nobody else can, then what is that work doing in the scientific journals?" ®

Next gen security for virtualised datacentres

More from The Register

next story
Vulture 2 takes a battering in 100km/h test run
Still in one piece, but we're going to need MORE POWER
TRIANGULAR orbits will help Rosetta to get up close with Comet 67P
Probe will be just 10km from Space Duck in October
Gigantic toothless 'DRAGONS' dominated Earth's early skies
Gummy pterosaurs outlived toothy competitors
Boffins ID freakish spine-smothered prehistoric critter: The CLAW gave it away
Bizarre-looking creature actually related to velvet worms
CRR-CRRRK, beep, beep: Mars space truck backs out of slippery sand trap
Curiosity finds new drilling target after course correction
'Leccy racer whacks petrols in Oz race
ELMOFO rakes in two wins in sanctioned race
What does a flashmob of 1,024 robots look like? Just like this
Sorry, Harvard, did you say kilobots or KILLER BOTS?
NASA's rock'n'roll shock: ROLLING STONE FOUND ON MARS
No sign of Ziggy Stardust and his band
Why your mum was WRONG about whiffy tattooed people
They're a future source of RENEWABLE ENERGY
prev story

Whitepapers

5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Build a business case: developing custom apps
Learn how to maximize the value of custom applications by accelerating and simplifying their development.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.