The Register® — Biting the hand that feeds IT

Comments on: DNA sequencing for the masses

Scary 

Posted Friday 7th March 2008 12:20 GMT

The scary truth is that this reflects the history of fingerprinting, from a crime detection point of view. The methods used in fingerprint identification today don't stand up to comparisons across millions of subjects but they still hold a great deal of public confidence. Planting a trace of a third party's dna at a crime scene may not be simple but it is by no means impossible.

This is just the first worm in a very large can.

Personal service 

Posted Friday 7th March 2008 13:06 GMT

Alien

I just took part in an interview to complete a report for Sky News about a company called 23andMe.com - they are backed by a few VC's and Google are in on it too.

They come at this from a slightly different angle (mainly interested in geographical and migratory genetic history) but produce the same detailed genetic info which can be used for any purpose.

The pack (you spit in a tube and send it off!) costs $999 - remarkable!

It'll be less than $500 in 6 months i bet. of course, i got mine free for volunteering (natch)

Will be interested to find out where my ancestors stomped about!

(alien as i am pretty convinced there is something funny in my family tree!)

Not scared 

Posted Friday 7th March 2008 13:08 GMT

Alien

I just took part in an interview to complete a report for Sky News about a company called 23andMe.com - they are backed by a few VC's and Google are in on it too.

They come at this from a slightly different angle (mainly interested in geographical and migratory genetic history) but produce the same detailed genetic info which can be used for any purpose.

The pack (you spit in a tube and send it off!) costs $999 - remarkable!

It'll be less than $500 in 6 months i bet. of course, i got mine free for volunteering (natch)

Will be interested to find out where my ancestors stomped about!

(alien as i am pretty convinced there is something funny in my family tree!)

Moore's law? 

Posted Friday 7th March 2008 13:15 GMT

Can somebody please explain how Moore's law is related to DNA sequencing? I'm not a geneticist here, but last I checked, DNA did not have transistors. And since Moore's law states that the number of transistors used to create a processor roughly doubles every 18 months, I fail to see how that is even remotely related to DNA sequencing. Even the alternately-accepted meaning (processor speed roughly doubles every 18 months) isn't even remotely related to DNA sequencing.

Re: Scary 

Posted Friday 7th March 2008 13:18 GMT

Black Helicopters

It's certainly a lot easier to plant DNA than fingerprints.

Why Moore's law is relevant... 

Posted Friday 7th March 2008 13:47 GMT

Black Helicopters

For two reasons. To understand why you have to understand how whole genome shotgun sequencing works (the Venter approach). You basically take a genome and blast it into loads of little pieces of random length. The smaller the pieces, the easier they are to sequence. However, you've got to work out where all the sequences go in the genome. If you've only got one copy of the genome, it's impossible, so you blast, say, five copies of the genome, sequence them, and because each copy is split in different places, you end up with areas of overlap which you can use to figure out which piece goes where.

Moore's law is relevant, because although a lot of little pieces are (relatively) easy to sequence, it required a sodding enormous computer to calculate where all the overlaps go. Thus, Moore's law is directly relevant, because the more powerful the computer, the quicker the sequencing can be done, and the fewer original genome copies are required in the first place (because the fewer copies are available, the more computing power is required).

The other reason relates to the sequencing machines themselves. I'm a bit more hazy on the technology here, but basically, something like Moore's law is relevant, because the more dna samples you can fit onto a sample carrier (one technique involves placing them in a square pattern), the more sequences you can do in one go.

@Chris C 

Posted Friday 7th March 2008 13:57 GMT

Moors law is used in reference to the computer used to analyse the chemical data and produce a dna sequence which is what takes most of the time

Scary 2 

Posted Friday 7th March 2008 13:57 GMT

"It's certainly a lot easier to plant DNA than fingerprints." ...By Kevin Whitefoot Posted Friday 7th March 2008 13:18 GMT

Kevin,

And a lot easier to disprove if planted in the wrong place. In the right place, it will grow and in the wrong place it will prove itself false.

And who would want to plant DNA anyway? For what Honest to Goodness Reason?

@amanfromMars 

Posted Friday 7th March 2008 14:29 GMT

not for a Honest to Goodnest reason, but for a dishonest reason.

Why Moore's law is really relevant 

Posted Friday 7th March 2008 14:32 GMT

Boffin

Because, the matrix style cascade of ATCG characters, the flashy 3d double helix graphics that, the [usually attractive] CSI worker needs displaying as part of the sequencing, and the fast-imaging switching of various ethnic faces just before a "MATCH FOUND" and the last face, that of an unshaved bit part character actor, is displayed and the sinister music starts playing.

This all requires substantial computing power.

Moreso for sequencing, We need the test to flash 3d graphics that show what the person with that DNA looks like, how big and what shape her breasts are, how intelligent she is, what embarrassing conditions she is susceptible to, what she had for lunch and so on, who her favourite rock band is. What do you mean it can't tell you that? Are you a cynic?

re: Moore's law? 

Posted Friday 7th March 2008 14:40 GMT

Boffin

Genome sequencing is very heavily dependent on computing power because after the actual sequencing the data needs to be assembled.

It's like shaking all the words from Shakespeare's complete works into a big pile on the floor and trying to put them back the correct order, but with many, many more words.

Previously, a sequencing run would output tens of megabases (millions of letters) per run. We now have upto a gigabase (a billion letters) *per run* of information that needs assembling and considering a run takes about a day, over a working week that's a lot of data. We've been rather spoilt with processing power lately, but this new technology really pushes our hardware!

@Scary 2 

Posted Friday 7th March 2008 14:45 GMT

Black Helicopters

it's not the Honest reasons we're all worked up about..

amanfromMars 

Posted Friday 7th March 2008 15:10 GMT

Alien

Ive been doing some snooping and...

www.ur2die4.com

seems our favourite funny man has his own website now....

AC just in case the little green men come to get me....

DNA sequencing and Moore's Law 

Posted Friday 7th March 2008 19:59 GMT

It's an analogy, and if anything, it understates the rapid advances in sequencing technology.

In the early years of the International Human Genome Project, DNA sequencing was done using gel electrophoresis, a technique that was heavily labour-intensive and rather prone to errors.

Then in 1998, a new technique called capillary sequencing became available through a machine called the Applied Biosystems Inc. 3700, which allowed greater quantities of DNA to be sequenced at lower cost, with far less human involvement, and with a lower error rate. This ushered in "high throughput" sequencing and allowed the human genome to be sequenced far faster than had originally been thought possible when the Human Genome Project was started.

Sequencing technology has continued to develop extremely rapidly. In the past three or four years, several companies, including 454 Life Sciences, Illumina and ABI, have developed machines that can sequence amounts of DNA equivalent to several entire human genomes in a day. It's reached the point where the real problem is how to store the vast amounts of raw data which these machines produce. The Grauniad ran a good article last week:

http://www.guardian.co.uk/technology/2008/feb/28/research.computing

I'm an astronomer by training, but for the past ten years I've worked at one of the world's leading DNA sequencing centres. If telescopes had developed as fast as DNA sequencing technology, Galileo would have been using 10-metre telescopes with adaptive optics back in the early 17th century.

More computer in DNA 

Posted Friday 7th March 2008 20:40 GMT

Linux

Many already explained, so just a little storage detail here... I'd like to point out that a few hours of activity from our smallest new generation sequencer I work with here (a 454 FLX) can generate about 50 GB of raw data (images, mostly), depending on the size of the job. The other new-gen sequencer (Solexa) generates 1 TB in a day, I've heard. So even more than CPU, our problem now is storage and back up. Where did I put that USB key again?

Oh, and it all runs on Linux. (at least the 454; most of the older ones used to run on Macs)

I'm not sure, but ... 

Posted Saturday 8th March 2008 22:11 GMT

... it looks a little like amanfromMars has just passed a very informal Turing Test. Congratulations, amanfromMars!

Don’t Miss