The Register® — Biting the hand that feeds IT

Feeds

Public genome databases can leak identity

Anonymity only goes so far

Cloud storage: Lower cost and increase uptime

Public genome data is a significant risk to individuals, according to research led out by Yaniv Elrich, a geneticist at the Whitehead Institute for Biomedical Research.

The team that Elrich led was able to de-anonymise genome data using only public information and careful Internet searches. A little chillingly, individuals could be associated with patrilineal genetic characteristics, even if they weren’t in the databases. A family member’s presence in the database can be enough, if they’re related in the male line and carry the same surname.

Working with data published in two public genomic databases, Ysearch and SMGF, Elrich demonstrated the privacy risk by matching chromosome data with 50 individuals, in a paper published in Science (abstract here, full paper available free with registration).

Among the genome data recorded in the databases is a genetic marker called “short tandem repeats” (for which genetic science hasn’t yet identified a specific purpose), which are passed down the male line.

As the paper notes, it had been assumed that listing surnames in the databases didn’t place individual identity at risk, since surnames “could match thousands of individuals”. However, the genome data has become a genealogy tool as well, in databases such as YBase.

DNA sequencing pioneer Dr Craig Venter volunteered as a test subject in the research. With only the relevant DNA sequence, Dr Venter’s age, and the US state where he lives, Erlich was able to retrieve just two possible records – one of which was Dr Venter.

With a known surname, the searches become even more accurate: “Combining the recovered surname with additional demographic data can narrow down the identity of the sample originator to just a few individuals,” Erlich states in the paper.

“Surname inference from personal genomes puts the privacy of current de-identified public data sets at risk”, it continues.

“In five surname recovery cases, we fully identified the CEU* individuals and their entire families with very high probabilities … data release, even of a few markers, from one person can spread through deep genealogical ties and lead to the identification of another person who might have no acquaintance with the person who released his genetic data”. ®

*CEU refers to a particular genetic dataset: “multigenerational families of northern and western European ancestry in Utah who had originally had their samples collected by CEPH (Centre d’Etude du Polymorphisme Humain)”. ®

Customer Success Testimonial: Recovery is Everything

Read more closely

Anonymised DNA data can be used to identify individuals.

8
0
Anonymous Coward

There you have it

I bet the Government and all 'interested' parties are rubbing their hands with glee.

5
0

I downloaded just Dr Craig Venter's genome data. After a bit of anaylsis I predict he will be baldy bloke. I now just need his email address so I can spam him with my toupee services.

1
0

More from The Register

New material enables 1,000-meter super-skyscrapers
Before you read on, see if you can guess how the new stuff will be used
 breaking news
You've seen the Large Hadron Collider. Now comes the HUGE Hadron Collider
International Linear Collider ready to rock and roll
 breaking news
Latest NASA ASTRONAUT class is HALF FEMALE
Newbie 'nauts include lady Marine fighter pilot, male doctor
Boffins find evidence Atlantic Ocean has started closing
'Embryonic subduction zone' that flattened Lisbon headed for Blighty
Google launches broadband balloons, radio astronomy frets
A careless Loon could blind the square kilometre array
Headbangers have a gas, gas, gas in mosh pits
Boffins say heavy metal crowds behave like The Vapours
Hubble spies unlikely planet being born in hostile neighborhood
Hoovering a cloud of sand 7.5 billion miles from a tiny star
 breaking news
Jaguar to open new car-making factory in Blighty (virtually)
Britain still makes stuff, it's just not real any more...
 breaking news
Spin doctors brazenly fiddle with tiny bits in front of the neighbours
Quantum computer address bus just nanometres wide
 breaking news
China's second woman 'naut blasts off for coupling in HEAVEN
Wang and pals test the cosmic waters for Chinese space station