Feeds

Algorithm ramps up genetic computation

'Sailfish' boosts RNA gene expression predictions

The next step in data security

The world has built DNA genomes for a long time, but applying what we know about genetics to everyday medicine is a tough ask.

For example, readers might remember that the business of crafting treatments from genes is so complex that IBM recently entered a partnership to get its Watson megabrain learning to help medicos craft personalised treatments for cancer.

Part of the problem that researchers want to solve is “gene expression”: in all the complexities of how genes interact, what interactions are “expressed” in a physical trait? – whether that trait is blue eyes, or why one individual dies of a cancer that's arrested in someone else.

What's wanted is a way to predict gene expression, and one angle of the research is based on RNA sequencing (RNA-seq) data. The problem is that analysing RNA sequencing is a slow business, and that's where the research out of Carnegie-Mellon University and the University of Maryland comes in. Their Sailfish algorithm dramatically accelerates estimates of the likely outputs of RNA sequence.

To explain why this is important, the researchers' release says: “Though an organism's genetic makeup is static, the activity of individual genes varies greatly over time, making gene expression an important factor in understanding how organisms work and what occurs during disease processes. Gene activity can't be measured directly, but can be inferred by monitoring RNA, the molecules that carry information from the genes for producing proteins and other cellular activities.”

However, analysing the RNA-seq “reads” – short sequences of RNA – traditionally results in huge datasets that have to be mapped back to their original genetic processes. The Sailfish “secret sauce” (except that it's not so secret – the code has been released here) is that it skips this painstaking mapping step.

Instead, the researchers “found they could allocate parts of the reads to different types of RNA molecules, much as if each read acted as several votes for one molecule or another”. Think of it as upvoting posts in a forum: individual votes bestow a kind of consensus on which reads – or posts – carry the greatest significance.

Getting what might be a 15-hour analysis down to minutes is important, the researchers believe: there are already huge repositories of RNA-seq data, but turning data into insight is held back by computational effort.

Fifteen hours for each analysis “really starts to add up, particularly if you want to look at 100 experiments”, explains Carnegie-Mellon associate professor Carl Kingsford. “With Sailfish, we can give researchers everything they got from previous methods, but faster.” ®

Choosing a cloud hosting partner with confidence

More from The Register

next story
SCREW YOU, Russia! NASA lobs $6.8bn at Boeing AND SpaceX to run space station taxis
Musk charging nearly half as much as Boeing for crew trips
Boffins say they've got Lithium batteries the wrong way around
Surprises at the nano-scale mean our ideas about how they charge could be all wrong
Thought that last dinosaur was BIG? This one's bloody ENORMOUS
Weighed several adult elephants, contend boffins
Edge Research Lab to tackle chilly LOHAN's final test flight
Our US allies to probe potential Vulture 2 servo freeze
Europe prepares to INVADE comet: Rosetta landing site chosen
No word yet on whether backup site is labelled 'K'
India's MOM Mars mission makes final course correction
Mangalyaan probe will feel the burn of orbital insertion on September 24th
Cracked it - Vulture 2 power podule fires servos for 4 HOURS
Pixhawk avionics juice issue sorted, onwards to Spaceport America
City hidden beneath England's Stonehenge had HUMAN ABATTOIR. And a pub
Boozed-up ancients drank beer before tearing corpses apart
prev story

Whitepapers

Providing a secure and efficient Helpdesk
A single remote control platform for user support is be key to providing an efficient helpdesk. Retain full control over the way in which screen and keystroke data is transmitted.
WIN a very cool portable ZX Spectrum
Win a one-off portable Spectrum built by legendary hardware hacker Ben Heck
Storage capacity and performance optimization at Mizuno USA
Mizuno USA turn to Tegile storage technology to solve both their SAN and backup issues.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Security and trust: The backbone of doing business over the internet
Explores the current state of website security and the contributions Symantec is making to help organizations protect critical data and build trust with customers.