CLC bio Satisfies Next-Gen Bioinformatics Cravings



By Kevin Davies

September 21, 2009
| From humble beginnings in 2005, Danish software company CLC bio has emerged as one of the leading software providers for the exploding genomics and next-gen sequencing market. The Danes say they aim to be “among the most innovative bioinformatics companies in the 21st century.”

After earning his Ph.D. from the University of Florida, Bjarne Knudsen saw a business opportunity to improve the quality, efficiency, and user-friendliness of life science software. He returned to Denmark, recruited his brother Thomas (now CLC bio’s CEO) to the cause, hired a few local software developers, and CLC bio was born.

The origin of the name is a closely guarded secret. “‘Cake Loving Company’ is a good guess, but it’s incorrect,” says CLC bio North America CEO Jan Lomholdt.

CLC bio initially took a “Microsoft approach” to bioinformatics. “Everyone has to have it on their desktop, download it from the Internet. It was global thinking from the beginning,” says Lomholdt. The resulting Workbench suite, going to head to head with the likes of DNASTAR, Vector NTI, and Gene Codes, proved competitive. It was platform independent and “people could just right-click and get [GenBank] directly into their program,” says Lomholdt. By 2006, CLC bio had serviced 100,000 free downloads.

Adding more functionality, CLC bio attracted its first industry customers. With next-gen sequencing platforms emerging, Lomholdt and colleagues worked with the major vendors—Roche/454, Illumina, Applied Biosystems (AB), and Helicos—to ensure it could handle all types of sequence data.

Released in 2008 at the Bio-IT World Expo, the Genomics Workbench operates as a desktop application for next-gen sequencing analyses. Early this year, CLC bio added a three-tier server architecture with CLC Genomics Server, providing a server structure for CPU-intensive jobs, a database component, and the ability to use Genomics Workbench as a thick client and develop and execute customized plug-ins centrally.

Lomholdt says that the major next-gen platforms are “very good in hardware technology [but] they are not good in doing software—and they know it. Customers demand downstream analysis. When we said, ‘Hey, we can do this,’ they said, ‘Finally, there is one we can point to when the customer gets mad!’”

Lomholdt admits that CLC bio was a little late in supporting AB’s color space, but does now support SOLiD data. Moreover, the vendors recognize that many customers use multiple platforms. “We can be the one that merges their different datasets,” says Lomholdt. “We can handle long or short reads and merge them to get a higher quality result.”

Venter Validation
A coup was attracting the J. Craig Venter Institute (JCVI). “He [Venter] found out what we were doing, and said this is exactly what I need,” recalls Lomholdt. “We come from Denmark near LegoLand! We do bricks—build your own special plug-ins and increase the value of your program.”

Granger Sutton, JCVI’s senior director of informatics, said JCVI would implement the full enterprise platform as it integrated workflows “across different technologies and geographical sites.”

JCVI incorporated CLC bio’s accelerated versions of HMM algorithms into its pipeline for metagenomic annotations. According to JCVI director of bioinformatics software, Saul Kravitz, the single instruction, multiple data (SIMD)-accelerated tools increase analysis capacity and “take advantage of our annotation pipeline without any further hardware investments.”

One reason for CLC bio’s success is that it has consistently worked on algorithm optimization. Customers appreciate the improvements in RAM allocation allowing de novo assembly of a human genome on a single computer with 32 Gigabytes of RAM in 17 hours, as opposed to the massive RAM requirements by open source alternatives, says Lomholdt.

“We have an assembler that can challenge MAQ—higher coverage and speed, less RAM consumption,” says Lomholdt. Last June, CLC bio unveiled CLC Genomics Machine, bundling next-gen sequencing software with the hardware. “IT is a very important component. The bioinformatician is important, the scientist is important. Now we have a solution for all three.” 

Danish Delight
Among other notable clients for CLC bio is Saudi Biosciences. The sequencing of the first Arab genome was outsourced to BGI Shenzen, which transmitted the raw sequence data to CLC bio for the assembly as well as some “special analysis” for the Arabs (Lomholdt declined to elaborate). Head of bioinformatics, Ruiqiang Li, said Genomics Workbench is “simply in a league of its own when it comes to flexibility.”

Customers can obtain the software in many models—renting, leasing, owning, or site licenses. The Albert Einstein Epigenomics Center in New York is using the CLC portfolio for teaching, and the software used as an educational tool at Harvard and elsewhere. “One of our efforts is to help scientists get the Nobel Prize! Some say publications, I say Nobel Prize—why not?! To do that, you have to visualize the analysis.”

The CLC Genomics Server won a Best of Show award at the 2009 Bio-IT World Expo. University of Pittsburgh’s Michael Barmada called the CLC Genomics Server an ideal platform and said, “it’s nice to see complex computational algorithms and routines presented in a user-friendly environment with a very elegant interface.”

The company now has 50 staff, with bureaus in Singapore, Brazil, India, the U.K. “We think we’re in a very sweet position between the vendors, the customers, the HPC, the analysis, and the algorithm development,” says Lomholdt. 



This article also appeared in the September-October 2009 issue of Bio-IT World Magazine.
Subscriptions are free for qualifying individuals. Apply today.

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1



White Papers & Special Reports

sgi whp 2
Managing the Modern Genomics Data Flood
Sponsored by SGI

Managing and storing the perfect storm of multi-disciplined data pouring from next generation sequencers and other omics instruments is a central challenge in life sciences. Discover in this paper how the SGI ArcFiniti storage solution, optimized for unstructured genomics and life sciences data can: 

  • Reduce costs, proactively protect data integrity, and deliver the high performance I/O required for genomics data processing and analysis.  
  • Effectively manage capacities from 156TB to 1.4PB as a disk based, integrated hardware and software platform 


sgi - whp 1
Turning Genomics Data into Practical Insight
Sponsored by SGI

With worldwide sequencing capacity approaching 13 quadrillion DNA bases annually turning genomics data into knowledge is a true computational challenge. Read this paper and learn how the SGI UV coherent shared memory platform can:  

  • Speed results time while cost competitively tackling the most difficult computational problems across all omics disciplines. 
  • Push performance by scaling to extraordinary levels, up to 256 sockets (2,560 cores, 4,096 threads) per single system (one OS image). 

Provide support for up to 16TB of coherent shared memory in a single system image enabling extreme efficiency across a wide range of compute demands. 



accerlys-logo_2012_wh
New Complimentary Market Survey…
Collaborations and Communications Within Drug Discovery Research
Sponsored by Accelrys
This survey was conducted by the Cambridge Healthtech Media Group in January, 2012. It was sponsored by Accelrys related to their HEOS initiative to gather valid information around externalizing collaborative research while improving communications in the cloud. With 310 qualified industry respondents the survey findings reveal useful usage and trends patterns.  An insightful follow-on discussion and webinar related to this survey, and the HEOS by Scynexis SaaS portal is also available on the Bio-IT World website for complementary viewing.
 


Job Openings

tessella logo 
Scientific Software Engineer
Boston MA
$70,000 to $95,000
 

Tessella delivers software engineering and consulting services to leading pharmaceutical and biotech companies. We are recruiting Software Engineersto work with skilled bioinformaticians and scientists to identify business needs and recommend and develop technical solutions. Applicants require BS, MS or PhD in bioinformatics, biology or chemistry and 2+ years of software development in either: Java, C#, C++, C or VB.NET. 

Apply at http://jobs.tessella.com   

 

oxford nanopore logo 


 Early Access Collaborations Managers
Oxford Nanopore Technologies is developing a novel technology, GridIONTM for the direct, electronic analysis of DNA/RNA and other analytes.  As the system approaches the market, we are building a team of technically knowledgeable, highly motivated candidates with excellent customer service and facilitation skills to join our company as Collaboration Managers.  This is a unique opportunity to work with world-leading genomics customers throughout the early adoption phase of a new generation of DNA sequencing technology.. This is a facilitative, enabling role with responsibility for managing technology development collaborations with key customers at leading genomics institutions.  It will include long term management of the collaboration plan and milestones and associated meetings and documentation. Click here to find out more and apply   

Oxford Nanopore's GridION technology, VP, Sales and Marketing Oxford Nanopore Technologies is a fast-moving technology company that is developing a novel electronic molecular analysis technology. The technology is adaptable for the analysis of DNA/RNA, proteins, chemicals and other molecules.  It is therefore suitable for use in a variety of markets including scientific research and clinical applications.  As the technology approaches the market, Oxford Nanopore is seeking a visionary VP of sales and marketing to join the senior team.  The candidate will embrace the opportunities afforded by entering the market with a truly disruptive technology that has the potential to expand the number of users and the variety of applications in each target market.  This is a rare opportunity to influence the commercial strategy at an early phase of its commercial lifetime, in a well funded company.  Oxford Nanopore welcomes applications from candidates with a track record of high-level strategic commercial  leadership, who wish to apply a fresh approach to existing markets.  Experience in Life Sciences/DNA sequencing is central to this role, however we will consider your application if you have experience of disruptive technologies in other related industries.  We are particularly interested in candidates with strong expertise in the use of digital technologies for sales and marketing of scientific/technical products.  Click to  Apply  


 

For reprints and/or copyright permission, please contact  Tim McLucas, (781) 972-1342, tmclucas@healthtech.com .