TurboWorx Ties On-Demand and In-House Workflows


By Salvatore Salamone Bio-IT World

TurboWorx, in a partnership with IBM, has extended its workflow and distributed computing software to include IBM’s On Demand computational services.

As a result, computation jobs, which up to now TurboWorx could run on a variety of in-house compute nodes and clusters, can also run on systems in IBM’s managed, high-performance On Demand computing centers.

The practical consequence of the partnership is that life science organizations will now be able to integrate the running of in-house and on-demand compute jobs – something that is not commonly done today.

“With this partnership, [life science organizations] can set up workflows that extend their in-house capacity to take advantage of IBM’s On Demand computing services,” says Jason Alter, vice president, marketing and services at TurboWorx.

This very different from the norm. On-demand computing services have been offered for several years to handle peaks in computing needs. But up to now, these services have typically been used for specific, dedicated projects and computing jobs.

With the partnership, a life science organization using TurboWorx’s software could design a computing workflow that automatically accesses IBM’s On Demand services within the execution of a job. For example, if a cluster is running one application and a second job needs more processing power than is available, the TurboWorx software could be set up to automatically run the second job using the On Demand service.

Similarly, the incorporation of on-demand computing power could be used in computing workflows where certain steps in a workflow require more processing power than might be available at the time the step is executed.

Thought Experiment
When discussing the partnership, a TurboWorx customer came up with a hypothetical application where this linkage between in-house and on-demand might help speed its research efforts.

The customer (a research organization that did not want to be named) needs lots of computing to help with its microscope image processing. One idea it had is to tap the processing power of Blue Gene as offered through an IBM On Demand service.

The application it has in mind involves the deconvolution of its microscope images. Deconvolution is a technique commonly used in image analysis. The need for this technique lies in the fact that image data is a combination of the true image plus factors such as distortions or variations due to the instruments and techniques used to capture the image. If one can determine the impact of these extra factors and compensate for them, the true image can be recovered.

An analogy can be made with a 35-mm camera that accidentally over-exposes an entire roll of film. A person developing the film could correct each photo one-by-one. But if that person knows that all of the photos have been over-exposed by the same amount, he or she could determine a corrective action and apply the same correction to all the photos on the roll.

The deconvolution process with a research organization’s microscope images has several steps. First, stored compressed images must be decompressed. Then all images are run through deconvolution algorithms to search for and determine any factors that contribute to image distortion.. Once this is done, any image called up to be viewed or analyzed is corrected using the information derived from the deconvolution analysis of the entire set of images.

In such a workflow, things like the compression, decompression, display, and analysis of individual images could be handled with in-house computing facilities. But the deconvolution of the entire set of images is something that could use on-demand processing power.

Currently, deconvolution is run on large, shared-memory systems. While the algorithms are highly parallel, they have not typically been run on distributed-memory systems because of the huge amount of overhead required to move the large volumes of data around. But Blue Gene lends itself to this application because of its very large processing power. That is why this hypothetical workflow would be a good application of incorporating in-house and on-demand computing power.

There are many other life science applications that have this asymmetry in their computing workflows that would also lend themselves to a tight integration between in-house and on-demand services.

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

White Papers & Special Reports

Waters white paper image
Software Helps Doping Control Lab Streamline Results Management
Sponsored by Waters
The Karolinska University Hospital’s Doping Control Lab tests thousands of samples annually for stimulants, diuretics, and other masking agents. Increased regulatory pressure and new technologies increased the number of samples analyzed creating data management challenges. Waters® NuGenesis® Scientific Data Management System and TargetLynx™ Application Manager software were used to reduce the time required to calculate, review and search results.


sas whitepaper92
Managed Innovation, Assured Compliance
Sponsored by SAS
Discovery organizations are identifying a lot of promising compounds, but clinical research processes haven't kept pace with timely testing of all those potential therapies. This white paper describes how SAS® Drug Development supports true innovation across the clinical trial process.

In this white paper you will learn how to:

  • Assemble data to foster better collaboration
  • Get up-to-date information during clinical trials
  • Make informed decisions earlier in the trial process


BlueArc white paper image
Addressing Life Sciences Constantly Growing Data Challenges Research Environments
Sponsored by BlueArc
The continued explosion of raw experimental data, the increased use of video, the growing adoption of new data retention practices, and the move to high throughput computational workflows are all placing new demands on the way life sciences organizations store and manage their data.

Download this white paper to learn about:

  • Factors driving the data explosion in the life sciences
  • New data management issues that must be addressed
  • HPC trends that are placing new demands on storage
  • Storage solution attributes that address performance, manageability, and energy efficiency.


Life Science Webcasts & Podcasts

Medidata Solutions

Rising Clinical Trial Delays and Costs - Addressing the Cause, Not the Symptoms 

medidata podcastProtocol complexity is taking a toll on clinical study speed and efficiency: increasingly complicated and ambitious protocols are not only burdening sites and study volunteers but are also prolonging trials and increasing expenses. In response, sponsors have turned to global study placement, restructured site relationships and new site management practices, but the problem remains.

This podcast will discuss:

  • Why these responses address only the symptoms, not the underlying cause, of rising clinical trial delays and costs.
  • Results of a recent joint Tufts University / Medidata Solutions study.
  • New metrics benchmarking protocol design trends.
  • Systematic protocol design improvements and why they are essential to clinical trial performance excellence.

Speakers: Ken Getz, Senior Research Fellow at the Tufts Center for the Study of Drug Development, and Ed Seguine, General Manager, Trial Planning Solutions at Medidata.

Download Now 



More Podcasts

Job Openings

Director, Center For Information Technology (CIT) - National Institutes of Health  (NIH), Department of Health and Human Service
Located in Bethesda, MD. This position requires:
• High-level vision, leadership, management, and modernization of CIT programs and services.
• Strategic direction and policy development for CIT long-term operations and objectives.
• Serve as a key IT advisor to the NIH Chief Information Officer.
A TOP SECRET security clearance will be required.  More job detail is found at:  http://www.jobs.nih.gov under the Executive Jobs section.Or contact Ms.Winnie Garner at seniorre@od.nih.gov.  Applications must be received ELECTRONICALLY by (11:59 p.m.), December 17, 2008.  DHHS and NIH are Equal Opportunity Employers

Bioinformatics Manager- Lilly Singapore Centre for Drug Discovery
For more information click here 

For reprints and/or copyright permission, please contact The YGS Group, 1808 Colonial Village Lane, Lancaster, PA;

(717) 399-1900 ext. 125, or via email to Ashley.Zander@theYGSgroup.com.