July 20, 2008
| Bio-IT World > What Big Clusters
What Big Clusters


By BIO-IT World

What Big Clusters

Computers have come a long way in the past 20 years. "My thesis took three years of computational effort," says chemist David Osguthorpe of the University of Bath in England. Osguthorpe worked on an algorithm for predicting the formation of alpha helices and beta sheets. "I could do it in less than a week now," he says.

Not only are more powerful machines now available, they're also relatively inexpensive. By springing for a handful of basic, bottom-of-the-line PCs, anyone can piece together a system with enough power to tackle simple structure prediction without breaking the bank. "We type 'cheap cheap cheap' into a search engine and see what comes up," says Stanford University's Michael Levitt, who harvests his structure predictions from a string of 150 PCs running on Linux.

Although most CASPers have access to large clusters of dual processors — Pentium IIIs are pretty common — this year the Pittsburgh Supercomputing Center (PSC) is offering CASP5 participants time on its terascale computer system to help level the computational playing field. As of April, the PSC had allocated about 100,000 CPU hours on its machine, a cluster of 3,000 Compaq alpha processors capable of performing six teraflops (trillion floating-point operations per second). Another million hours are available, says the PSC's David Deerfield, who was expecting a rush of applications once the targets were in hand.

With some tweaking, CASPers should be able to compile their Fortran or C code programs on the PSC platform. The trick, notes the PSC's Troy Wymore, will be in writing programs that take full advantage of the parallel performance of the machines. By executing operations in parallel, a job that would take nearly six weeks on a single CPU can be completed in six hours when distributed over, say, 164 processors.

At the same time, computer scientists at IBM's Thomas J. Watson Research Center outside New York City continue to build Blue Gene, a petaflop machine that could add muscle to folding programs by allowing researchers to track every atom in a protein. Made from a million CPUs connected in ever larger bunches, Blue Gene will be able to perform a quadrillion (1015) calculations per second (petaflop). That's about a thousand times faster than Deep Blue, the computer that beat world chess champion Garry Kasparov in 1997.

Although the added speed will help investigators monitor the folding process, it might not be fast enough for CASP. Using an all-atom simulation, Blue Gene would take a year to fully fold a protein of 300 amino acids. At that rate, says Ram Samudrala of the University of Washington, "you might as well solve the structure in lab using X-ray crystallography or magnetic resonance techniques."

And in the end, the problems researchers face in predicting protein structure might be better attacked on the blackboard than the keyboard. "It's a hard problem," notes the University of Iowa's Alberto Segre, "because even if we had all the computational power in the world, we still don't fully understand the forces that make proteins adopt their shape."

So for now, Levitt says, an old-fashioned approach remains viable. "If I thought that with 10 times more computer power I could solve the folding problem, I'd go out and buy it," he says. "But maybe we need a million times more. So instead I sit down and think."

—Karen Hopkin

Back to Computational Biologists Join the Fold



Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

White Papers & Special Reports

definiens briefingon-76Next-Generation Technologies Revolutionizing Oncology and Diagnostics
underwritten by Definiens

This “Briefing On” collection of Bio-IT World features, commentaries and analysis, presents some of the latest thinking on high-throughput technologies that are being applied to the fields of research and drug discovery, with particular emphasis on oncology, diagnostics and imaging technologies. Download now at no charge compliments of the underwriting sponsor, Definiens. Download This Free Paper



gq nxt gen seq

This Bio•IT World Briefing On “Next-Generation Sequencing,” underwritten by GenomeQuest, Inc.,
presents a selection of feature stories, interviews,commentaries, conference reports, and editorials on the emergence, opportunities, and challenges posed by high-throughput sequencing. Covered in this collection: the launch of new platforms from Applied Biosystems and Helicos; new applications of nextgen sequencing; the rise of personal genomics; and informatics solutions to vexing problem of managing the vast volumes of next-gen data.  Download now 



Life Science Webcasts & Podcasts

GenoLogicsgenologics 2 translational
Enabling Translational Research Informatics

Learn about the challenges facing life sciences research labs to manage their translational research data:

  • The trends for organizations to adopt informatics solutions for translational research.
  • The unique requirements with managing complex data and workflow.
  • What labs should consider when reviewing informatics solutions for translational research.
  • Which life sciences research organizations are successfully adopting an informatics solution.

Download Now



More Podcasts

Job Openings

Assistant Editor (Science Writer)~Cambridge Healthtech Institute (CHI), Needham, MA, 
Cambridge Healthtech Institute seeks an assistant editor (science writer) who is an ambitious, dependable journalist who can fulfill a range of writing and editorial duties for a series of eNewsletters covering various aspects of the biopharmaceutical industry in addition to CHI’s flagship publication, Bio-IT World magazine.  This is a superb opportunity to make important contributions to the growth and success of a multimedia science publishing group, while gaining invaluable experience in multiple facets of the publishing industry.   Interested candidates should submit a cover letter, including 3 writing samples (attached in Word or PDF format), salary history or requirements, and resume to kdavies@healthtech.com. 

Fred Hutchinson Cancer Research Center: IT Business Analyst III
The Hutchinson Center is the only National Cancer Institute-designated comprehensive cancer center in the Pacific Northwest. Through our Tumor Research Initiative, we are finding new ways to detect tumors at an early stage.  We are presently seeking an experienced IT Business Analyst to assess technology needs for the Tumor Research Initiative, and to identify and design improvements to computer based systems.  For more information please visit www.fhcrc.org and search for Job# AD-21465





For reprints and/or copyright permission, please contact RMS, 1808 Colonial Village Lane, Lancaster, PA;

(717) 399-1900 ext 100 or via email to bio-itworld@theygsgroup.com.