February 11, 2012
| Bio-IT World > Man and Machine


Man and Machine


By Karen Hopkin

June 12, 2002 | While research groups spent the summer of 2000 scrambling to generate their predictions for CASP4, automated servers at two dozen sites around the world coolly submitted their best guesses for the same set of target proteins — less than 48 hours after the sequences were posted.

As genome-sequencing projects continue to churn out reams of data every day, researchers may come to depend on such automated processors to predict the structures of the thousands of proteins produced by even the simplest organisms. In CASP4, the automated servers did well on targets that were at least 40 percent similar to proteins of known structure, but their predictions fell short on proteins with unfamiliar sequences or novel folds. "We still need an expert to make judgments and push the right buttons," says Ajay Royyuru, head of the protein structure prediction group at IBM's Thomas J. Watson Research Center.

And maybe a human expert is all you need — if that expert is Alexey Murzin of the Medical Research Council's Laboratory of Molecular Biology in Cambridge, England. In a process he describes as part theory and part intuition, Murzin can eyeball an amino-acid sequence and see subtle patterns, telltale signs of a protein's evolutionary origin, that allow him to place the target into an already existing protein superfamily. Using other family members as templates, he then cobbles together a 3-D structure that looks good. In CASP4, Murzin's models placed among the top predictions for each target. "If we could codify the information in his head," says Howard Hughes Medical Institute investigator David Baker, "we'd be in great shape."

Until then, most CASP participants will continue to work toward automating their approach to structure prediction. If programs could do the bulk of the heavy lifting — examining a protein's sequence and choosing the most appropriate method for approximating its structure — without human intervention, researchers could turn their attention to more challenging problems, such as designing new proteins or learning how mutant proteins precipitate disease.

—Karen Hopkin

Back to Computational Biologists Join the Fold 




White Papers & Special Reports

sgi whp 2
Managing the Modern Genomics Data Flood
Sponsored by SGI

Managing and storing the perfect storm of multi-disciplined data pouring from next generation sequencers and other omics instruments is a central challenge in life sciences. Discover in this paper how the SGI ArcFiniti storage solution, optimized for unstructured genomics and life sciences data can: 

  • Reduce costs, proactively protect data integrity, and deliver the high performance I/O required for genomics data processing and analysis.  
  • Effectively manage capacities from 156TB to 1.4PB as a disk based, integrated hardware and software platform 


sgi - whp 1
Turning Genomics Data into Practical Insight
Sponsored by SGI

With worldwide sequencing capacity approaching 13 quadrillion DNA bases annually turning genomics data into knowledge is a true computational challenge. Read this paper and learn how the SGI UV coherent shared memory platform can:  

  • Speed results time while cost competitively tackling the most difficult computational problems across all omics disciplines. 
  • Push performance by scaling to extraordinary levels, up to 256 sockets (2,560 cores, 4,096 threads) per single system (one OS image). 

Provide support for up to 16TB of coherent shared memory in a single system image enabling extreme efficiency across a wide range of compute demands. 



accerlys-logo_2012_wh
New Complimentary Market Survey…
Collaborations and Communications Within Drug Discovery Research
Sponsored by Accelrys
This survey was conducted by the Cambridge Healthtech Media Group in January, 2012. It was sponsored by Accelrys related to their HEOS initiative to gather valid information around externalizing collaborative research while improving communications in the cloud. With 310 qualified industry respondents the survey findings reveal useful usage and trends patterns.  An insightful follow-on discussion and webinar related to this survey, and the HEOS by Scynexis SaaS portal is also available on the Bio-IT World website for complementary viewing.
 


Job Openings

tessella logo 
Scientific Software Engineer
Boston MA
$70,000 to $95,000
 
Apply at http://jobs.tessella.com   

oxford nanopore logo 


Early Access Collaborations ManagersClick here to find out more and apply   

Oxford Nanopore's GridION technology, VP, Sales and Marketing Click to  Apply  





For reprints and/or copyright permission, please contact  Tim McLucas, (781) 972-1342, tmclucas@healthtech.com .