Making DREAM(s) Come True Isn’t Easy


By John Russell

July 14, 2008 | How good is your algorithm for turning experimental data—mainly gene expression or protein interactions—into accurate pictures of biological networks?  If the results from the first DREAM challenge are any indication, plenty of progress is needed to improve pathway predictions from data.

The DREAM initiative—Dialogue for Reverse Engineering Assessments and Methods—is intended to help the life science community improve computational techniques for inferring networks. Datasets from known networks are provided and researchers are invited to uncover the true underlying networks using computational techniques. In this first “competition,” there were five challenges, 36 participating teams, and 110 predictions.

“DREAM is trying to understand whether we predict something meaningful when we take high-throughput data like gene expression and we say, well this is the network of interactions,” says one of DREAM’s organizers, Gustavo Stolovitzky. “Usually you validate with five or six connections that you either do in the lab or validate through literature. But actually you have made 1,000 predictions and you only cherry pick the five that match your data, so in a way, we don’t know whether we are fooling ourselves or whether really we have something in those predictions.”

On balance, this year’s predictions were lousy. Talking about one specific challenge, Stolovitzky says, “These were 200 genes of which 53 were true positives and the rest were not. Some people did well in the first 10 or 15, but then really not so well. At the same time many, many teams, some of which are very well known people that have one of these algorithms as their favorite algorithm [did very poorly] and their favorite algorithm is very bad, really bad, as bad as random so if you put a random predictor you will have predicted better.”

Predictions of protein interactions were even worse. Stolovitzky says simply, “If you, today, go with a set of proteins and sequences and you give it to someone who says, ‘I can predict which of these proteins are interacting,’ probably he is going to give you garbage. And that is a truth, but it is important to know that this is the case. It’s not to just roast the people who didn’t do well. It’s just to understand where we should improve.”

Ouch. Not surprisingly no commercial tool providers tackled the challenges. And let’s be clear—as Stolovitzky is—the idea isn’t to roast anyone. The goal is to learn which algorithms work best, to assist in developing better algorithms, and to help the entire community move forward. In releasing DREAM2 results, the names of the teams were not revealed.

Stolovitzky is adjunct associate professor of Biomed Informatics, Columbia University, and manager, Functional Genomics & Systems Biology, IBM Research. He notes that despite the poor results of the first set of challenges, much good work is being done by researchers using similar techniques.

Asked for his thoughts on recent work led by Merck researcher Eric Schadt in identifying key networks involved in metabolic disease by interpreting gene expression and other data (see “Merck’s Informatics Mission,” Bio-IT World, May 2008), he says, “They are great. I think he’s doing something that we all should be doing, and we are not doing in DREAM, which is he puts together a lot of [different kinds of] information like quantitative analysis, gene expression, and clinical information.”

Round Three
Plans are already afoot for DREAM3 which will be held in conjunction with 5th Annual RECOMB Satellite on Regulatory Genomics and the 4th Annual RECOMB Satellite on System Biology next October 29-Nov 2 (http://compbio.mit.edu/recombsat/). The meeting is jointly organized by the Broad Institute of MIT and Harvard, and the MIT Computer Science and Artificial Intelligence Lab (CSAIL). “We would like to cordially invite anyone to submit network inferences and answers to our new biological prediction challenges. To access the data sets and descriptions of the challenges, please go to http://wiki.c2b2.columbia.edu/dream/index.php/The_DREAM3_Challenges,” say Stolovitzky and DREAM3 co-organizer, Andrew Califano.

Interest in DREAM is growing, reports Stolovitzky. The International Society for Computational Biology is creating a team of students to participate in DREAM he says. He also says they will probably ask researchers if they want to be identified next year. Given the suggestion that some well-regarded researchers fared less well, this might by an opportunity for others to gain notice.

In any case, for the foreseeable future, consistent prediction of underlying networks from just a few data types and using only computational techniques is still something of a dream.

___________________________________________________

This article appeared in Bio-IT World Magazine.

Subscriptions are free for qualifying individuals.  Apply Today.

 

 

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

White Papers & Special Reports

Waters white paper image
Software Helps Doping Control Lab Streamline Results Management
Sponsored by Waters
The Karolinska University Hospital’s Doping Control Lab tests thousands of samples annually for stimulants, diuretics, and other masking agents. Increased regulatory pressure and new technologies increased the number of samples analyzed creating data management challenges. Waters® NuGenesis® Scientific Data Management System and TargetLynx™ Application Manager software were used to reduce the time required to calculate, review and search results.


sas whitepaper92
Managed Innovation, Assured Compliance
Sponsored by SAS
Discovery organizations are identifying a lot of promising compounds, but clinical research processes haven't kept pace with timely testing of all those potential therapies. This white paper describes how SAS® Drug Development supports true innovation across the clinical trial process.

In this white paper you will learn how to:

  • Assemble data to foster better collaboration
  • Get up-to-date information during clinical trials
  • Make informed decisions earlier in the trial process


BlueArc white paper image
Addressing Life Sciences Constantly Growing Data Challenges Research Environments
Sponsored by BlueArc
The continued explosion of raw experimental data, the increased use of video, the growing adoption of new data retention practices, and the move to high throughput computational workflows are all placing new demands on the way life sciences organizations store and manage their data.

Download this white paper to learn about:

  • Factors driving the data explosion in the life sciences
  • New data management issues that must be addressed
  • HPC trends that are placing new demands on storage
  • Storage solution attributes that address performance, manageability, and energy efficiency.


Life Science Webcasts & Podcasts

Medidata Solutions

Rising Clinical Trial Delays and Costs - Addressing the Cause, Not the Symptoms 

medidata podcastProtocol complexity is taking a toll on clinical study speed and efficiency: increasingly complicated and ambitious protocols are not only burdening sites and study volunteers but are also prolonging trials and increasing expenses. In response, sponsors have turned to global study placement, restructured site relationships and new site management practices, but the problem remains.

This podcast will discuss:

  • Why these responses address only the symptoms, not the underlying cause, of rising clinical trial delays and costs.
  • Results of a recent joint Tufts University / Medidata Solutions study.
  • New metrics benchmarking protocol design trends.
  • Systematic protocol design improvements and why they are essential to clinical trial performance excellence.

Speakers: Ken Getz, Senior Research Fellow at the Tufts Center for the Study of Drug Development, and Ed Seguine, General Manager, Trial Planning Solutions at Medidata.

Download Now 



More Podcasts

Job Openings

Director, Center For Information Technology (CIT) - National Institutes of Health  (NIH), Department of Health and Human Service
Located in Bethesda, MD. This position requires:
• High-level vision, leadership, management, and modernization of CIT programs and services.
• Strategic direction and policy development for CIT long-term operations and objectives.
• Serve as a key IT advisor to the NIH Chief Information Officer.
A TOP SECRET security clearance will be required.  More job detail is found at:  http://www.jobs.nih.gov under the Executive Jobs section.Or contact Ms.Winnie Garner at seniorre@od.nih.gov.  Applications must be received ELECTRONICALLY by (11:59 p.m.), December 17, 2008.  DHHS and NIH are Equal Opportunity Employers

Bioinformatics Manager- Lilly Singapore Centre for Drug Discovery
For more information click here 

For reprints and/or copyright permission, please contact The YGS Group, 1808 Colonial Village Lane, Lancaster, PA;

(717) 399-1900 ext. 125, or via email to Ashley.Zander@theYGSgroup.com.