February 11, 2012
| Bio-IT World > Protein Folding, Anyone?


Protein Folding, Anyone?


By Salvatore Salamone

October 15, 2003 |  FLUSHING MEADOWS, N.Y. -- What do Andy Roddick and protein folding have in common? The U.S. Open tennis tournament.

While fans spent lulls in the action discussing Roddick’s rocketing serves, IBM life scientists were running algorithms on the same computers being used to track and report tournament results.

Taking advantage of a soon-to-be-announced IBM on-demand computational technology (code-named Project Symphony), company scientists snapped up unused processing cycles from the computers hosting the Open's Web site to test protein-folding algorithms that will eventually run on IBM's next-generation supercomputer, Blue Gene.

"We're preparing the simulations well before the Blue Gene hardware is available," says Ajay Royyuru, senior manager of IBM Research's computational biology center. "We use machines that we routinely have access to." Processing cycles are the limiting factor. "So we looked where else we could go to compute," Royyuru says.

“Where else” turned out to be the IBM servers hosting the Open's Web site, which offered a real-time scoreboard, player stats, and a means to purchase tickets and merchandise. Enough capacity was set aside to handle peak activity periods, which meant there would be many periods during the Open when server processing power would be sitting idle.

IBM researchers were able to use these computing cycles via a component of Project Symphony called the Tivoli Intelligent Orchestrator. The Orchestrator allows an IT manager to tap underutilized processing capacity of commodity servers to deliver supercomputer power to applications.

The Other Fast Server
While work on Blue Gene's hardware progresses (see Blue Gene is Cool for 2006, July 2003 Bio-IT World, page 1), researchers are developing protein-folding algorithms to run on the supercomputer, which is expected to be available in 2006.

The goal of the research is to answer the basic question: "What is it in a protein that causes it to misfold?" says Jed Pitera, a staff member at IBM's Almaden Research Center. Diseases associated with misfolding include Alzheimer's, “mad cow,” and cystic fibrosis.

By tapping the unused cycles of 127 CPUs of the U.S. Open's servers, Pitera and colleagues were able to run 150 simulations of small peptides folding in water. Each simulation had 9,000 to 10,000 atoms and showed how the peptides would fold in two to five nanoseconds.

Blue Gene will be able to perform these folding calculations on 10 times more atoms, and accelerate the simulation time by a factor of about 1,000. That translates into running a simulation for medium-sized proteins of 50,000 to 100,000 atoms in microseconds.

But to get to the point where the Blue Gene hardware can be efficiently used for calculations, the algorithms must be fine-tuned. And that's exactly what the Tivoli Intelligent Orchestrator is allowing the IBM researchers to do. "This lets us develop our algorithms and test our software before Blue Gene is available," says Bill Swope, also a staff member at the Almaden Research Center. "We can make effective use of Blue Gene as soon as it's available."

The IBM Tivoli Intelligent Orchestrator, scheduled to be introduced by now, will have a starting price of $20,000.



White Papers & Special Reports

sgi whp 2
Managing the Modern Genomics Data Flood
Sponsored by SGI

Managing and storing the perfect storm of multi-disciplined data pouring from next generation sequencers and other omics instruments is a central challenge in life sciences. Discover in this paper how the SGI ArcFiniti storage solution, optimized for unstructured genomics and life sciences data can: 

  • Reduce costs, proactively protect data integrity, and deliver the high performance I/O required for genomics data processing and analysis.  
  • Effectively manage capacities from 156TB to 1.4PB as a disk based, integrated hardware and software platform 


sgi - whp 1
Turning Genomics Data into Practical Insight
Sponsored by SGI

With worldwide sequencing capacity approaching 13 quadrillion DNA bases annually turning genomics data into knowledge is a true computational challenge. Read this paper and learn how the SGI UV coherent shared memory platform can:  

  • Speed results time while cost competitively tackling the most difficult computational problems across all omics disciplines. 
  • Push performance by scaling to extraordinary levels, up to 256 sockets (2,560 cores, 4,096 threads) per single system (one OS image). 

Provide support for up to 16TB of coherent shared memory in a single system image enabling extreme efficiency across a wide range of compute demands. 



accerlys-logo_2012_wh
New Complimentary Market Survey…
Collaborations and Communications Within Drug Discovery Research
Sponsored by Accelrys
This survey was conducted by the Cambridge Healthtech Media Group in January, 2012. It was sponsored by Accelrys related to their HEOS initiative to gather valid information around externalizing collaborative research while improving communications in the cloud. With 310 qualified industry respondents the survey findings reveal useful usage and trends patterns.  An insightful follow-on discussion and webinar related to this survey, and the HEOS by Scynexis SaaS portal is also available on the Bio-IT World website for complementary viewing.
 


Job Openings

tessella logo 
Scientific Software Engineer
Boston MA
$70,000 to $95,000
 
Apply at http://jobs.tessella.com   

oxford nanopore logo 


Early Access Collaborations ManagersClick here to find out more and apply   

Oxford Nanopore's GridION technology, VP, Sales and Marketing Click to  Apply  

For reprints and/or copyright permission, please contact  Tim McLucas, (781) 972-1342, tmclucas@healthtech.com .