New Snowden leak: How NSA shared 850-billion-plus metadata records

'Federated search' spaffed info all over Five Eyes chums

Remote control for virtualized desktops

Documents leaked by Edward Snowden suggest during the noughties, the NSA massively expanded the information it shared with its Five Eyes allies and other agencies.

In slides given to The Intercept, the NSA boasts that its ICREACH program “increases NSA communications metadata sharing from 50 billion records to 850+ billion records”, with a footnote adding that 126 billion of these new records came from “2nd party SIGINT partners”.

At the same time, there was also a massive increase in the metadata the NSA was storing. The date, time, duration, called number and calling party – drawn from old PSTN systems – plus more than 20 new fields including latitude and longitude became part of the data-sharing.

To cope with all of this the NSA also needed to create a federated database search engine, in essence its own middleware sitting between the databases and the users.

Fast growth of NSA datasets

Massive increase in NSA's metadata volumes in 2006/2007. Image: The Intecept

The documents, dated 2006 and 2007, describe ICREACH as a “federated query” engine that would search “across all data sets for information relating to a target identifier.” The leaked documents say an agent could search from a single login to retrieve “phone number, global mobile satellite and cellular events and selectors, e-mail address, etc”.

Data brokers were also crafted to give the NSA's partners – the Australian Signals Directorate (then DSD), Britain's GCHQ, New Zealand's GCSB, Canada's CSE and the US FBI – access to the same search capabilities.

New fields added to what the NSA shared

The NSA shared more metadata fields post-ICREACH. Image: The Intercept

While maintaining the position that metadata is “information about content (but not the content itself)”, the NSA also notes that the data collected includes “formats and protocols used to render the information for people and systems”.

The middleware also meant that analysts were able to work on intelligence leads “without requiring access to raw intelligence”, the leaks claim. ®

Choosing a cloud hosting partner with confidence


Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
Getting started with customer-focused identity management
Learn why identity is a fundamental requirement to digital growth, and how without it there is no way to identify and engage customers in a meaningful way.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Website security in corporate America
Find out how you rank among other IT managers testing your website's vulnerabilities.
Top 5 reasons to deploy VMware with Tegile
Data demand and the rise of virtualization is challenging IT teams to deliver storage performance, scalability and capacity that can keep up, while maximizing efficiency.