Complete Genomics Targets ‘The First $1000 Genome’



Sequencing service is on track for Spring 2009.

By Kevin Davies

Nov. 12, 2008 | Complete Genomics emerged from stealth mode in early October brandishing an audacious service model for wholesale next-generation sequencing, with its first human genome already assembled and the CEO’s pledge to reach the magical “$1000 genome” price point as early as spring 2009.

“Our mission is to be the global leader in complete human genome sequencing,” chairman, president, and CEO Clifford Reid told Bio-IT World. “We are setting out to completely change the economics of genome sequencing so that we can do diagnostic quality human genome sequencing at a medically affordable price. Essentially, [we’ll] transition this genome sequencing world from a scientific and academic endeavor into a pharmaceutical and medical endeavor.”

Complete-genomics-Clifford-Reid
Reid is setting out to completely change the economics of genome sequencing.

Based in Mountain View, Calif., Complete Genomics has raised $46 million in three rounds of financing since its incorporation in 2006. Complete Genomics will not be selling individual instruments, but rather offer a service aimed initially at big pharma and major genome institutes. The company is building what Reid calls “the world’s largest complete human genome sequencing center so we can sequence thousands of complete human genomes, so that researchers can conduct clinical trial-sized studies.”

If all goes according to plan, that 32,000-square-feet, $75 million facility will deliver 1,000 human genomes in 2009 and an eye-popping 20,000 genomes in 2010. But that’s just the beginning. The firm plans to build ten genomes centers in the U.S. and abroad over the next five years in partnership with various organizations and foreign governments.

The $4000 Genome
Although the data are unpublished—advisory board member Leroy Hood admitted he hadn’t seen the genome assembly results at the time of launch—Reid says his team produced its first human genome sequence last July. “It’s getting a bit long in the tooth already,” he jokes. “The total materials cost was about $4000.” The genome coverage was 22-fold from a total of 67 gigabases (Gb) of mapped reads. “The speed of the instrument is about ten times as fast as ABI and Illumina,” Reid claims. “This [project] ran four instruments for one run of a week. This is a 28- instrument-day experiment. By the launch of our product in Q2 [of 2009], it will be a 4-instrument-day experiment.

When the product is launched next spring, “we fully anticipate the materials cost of that genome will be just under $1000,” Reid says. “We’re going to price the genomes at $5000 each, which covers of course not only material but also instruments and labor and overhead.” Reid admits it’s an incomplete measure of cost, but it has become the industry’s standard accounting method. By those criteria, “It will be the first $1000 genome,” says Reid.

The Complete Genomics technology is based on co-founder Radoje Drmanac’s work in sequencing-by-hybridization using a ligation strategy and gridded arrays of up to one billion DNA “nanoballs.” Reid’s expertise in computer science and Drmanac’s in biochemistry proved to be a perfect complement. “The convergence of biotechnology and computing has really enabled a whole new generation of DNA sequencing that’s going to change the world,” he says.

First Service
Complete Genomics is not the first company to explore a service model for genome sequencing (see, “Genome Corp. Born Again,” Bio-IT World, Jan. 2008), but it is the first using a next-gen sequencing technology.

Reid cites two main reasons for choosing a services model. First, he sees the key market as large pharma conducting clinical trials. “The pharmaceutical market has declared very clearly they don’t want to buy instruments,” says Reid. “They want to buy services, so that they get the data that enables them to do the discovery and development work, rather than have to own and operate a large-scale genome sequencing center.”

Petabytes of Data

The Complete Genomics platform hinges on exquisite precision in manufacturing and arraying “nanoballs” of DNA. But it will be critical to manage gargantuan quantities of data. The task of building the data center falls to vice president of software Bruce Martin, a former executive with Sun and Openwave.

“I’ve built a team that is a little microcosm of what you see in the rest of the company,” says Martin, including bioinformaticians who worked with Craig Venter on genome assembly and the HapMap project, as well as experts in data mining, indexing databases, and high-throughput computing.

The imaging steps involve measuring hundreds of millions of spots. “We are currently generating close to a gigabit a second off the imager, and that’s going to go up by a substantial amount in the next year,” says Martin. “I have not only an extremely interesting computational challenge here, but there’s just a bandwidth problem… You can’t store images at that rate onto disk drives without spending a king’s ransom in storage.”

Martin says his group has had “a very successful run” with a clustered storage system from Isilon, which he likes for its “very high performance” and ability to scale to multi-petabyte file systems. “You can manage it with a very small footprint of staff. The Broad recently deployed them as well. I couldn’t say who got there first. We both basically have selected them for similar reasons.”

Due to space, power, and cooling considerations, Martin is exploring options with several high-density blade vendors. “We want to pack as many cores and as much memory into as small a footprint as we can for economic reason,” he says.

Martin says he’s made “a significant investment in an aligner” for rapid genome alignments that can scale to thousands of processors. “I went out and found some very significant expertise in Silicon Valley in terms of high-speed, large-scale search and indexing. We have many of the leading companies in the world in that area.”

If the ramp up for 2009 sounds daunting—1,000 genomes in a center housing 5 petabytes of data—the specs for sequencing 20,000 genomes in 2010 are positively frightening. “We’ll probably be in the 60,000-processor and 30-petabyte range in that time frame,” says Martin.

A second consideration is that the new sequencing technologies “generate a breathtaking amount of data,” says Reid. “Simply selling 10 or 20 instruments to a company doesn’t solve the problem. You then have to be able to mange huge volumes of data. We are putting in a Google-style data center to manage the data.” (See sidebar: “Petabytes of Data”)

Reid plans to build a further ten centers—for about $50 million apiece—in the U.S. and abroad over the next five years, in partnership with other companies, research organizations, and countries. Those ten genome centers will produce about one million genomes over the next five years. “A nice way to think about 1 million genomes is 1,000 people with each of 1,000 diseases. By the time we’ve done that, we will understand the genetic basis of all the important human diseases,” he says.

Scale Up
The near-term goal for 2009 is to focus on pilot sequencing projects for the commercial and academic communities to validate the technology and establish workflows that will form the operational blueprint for expansion. Ten percent of the firm’s sequencing capacity in the next two years will be devoted to a collaboration with advisory board member Hood. The Institute of Systems Biology president is a partner with the Government of Luxembourg on a $200-million biobank and personalized medicine project.

As for targeting pharmaceutical companies, Reid predicts two key groups of early adopters—companies pursuing cancer and mental illness. Both groups of diseases have a strong genetic component. Says Reid: “To date, the industry has not been able to find the rare variants that are causes of diseases and drug response. That’s a new capability we’re bringing.”

Another enticing constituency for Complete Genomics is the personal genomics or consumer genomics market. Reid agrees: “Knome and 23andMe and Navigenics and all those guys will essentially buy genome services from us and add a lot of value [and] transfer it on to the consumer population.” 

Editor’s Note: Complete Genomics CEO/Chairman Clifford Reid will be a keynote speaker at the 2009 Bio-IT World Expo (April 27-29, 2009).  

___________________________________________________ 

This article appeared in Bio-IT World Magazine.

Subscriptions are free for qualifying individuals.  Apply Today.

 

 

 

 

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1



White Papers & Special Reports

sgi whp 2
Managing the Modern Genomics Data Flood
Sponsored by SGI

Managing and storing the perfect storm of multi-disciplined data pouring from next generation sequencers and other omics instruments is a central challenge in life sciences. Discover in this paper how the SGI ArcFiniti storage solution, optimized for unstructured genomics and life sciences data can: 

  • Reduce costs, proactively protect data integrity, and deliver the high performance I/O required for genomics data processing and analysis.  
  • Effectively manage capacities from 156TB to 1.4PB as a disk based, integrated hardware and software platform 


sgi - whp 1
Turning Genomics Data into Practical Insight
Sponsored by SGI

With worldwide sequencing capacity approaching 13 quadrillion DNA bases annually turning genomics data into knowledge is a true computational challenge. Read this paper and learn how the SGI UV coherent shared memory platform can:  

  • Speed results time while cost competitively tackling the most difficult computational problems across all omics disciplines. 
  • Push performance by scaling to extraordinary levels, up to 256 sockets (2,560 cores, 4,096 threads) per single system (one OS image). 

Provide support for up to 16TB of coherent shared memory in a single system image enabling extreme efficiency across a wide range of compute demands. 



accerlys-logo_2012_wh
New Complimentary Market Survey…
Collaborations and Communications Within Drug Discovery Research
Sponsored by Accelrys
This survey was conducted by the Cambridge Healthtech Media Group in January, 2012. It was sponsored by Accelrys related to their HEOS initiative to gather valid information around externalizing collaborative research while improving communications in the cloud. With 310 qualified industry respondents the survey findings reveal useful usage and trends patterns.  An insightful follow-on discussion and webinar related to this survey, and the HEOS by Scynexis SaaS portal is also available on the Bio-IT World website for complementary viewing.
 


Job Openings

tessella logo 
Scientific Software Engineer
Boston MA
$70,000 to $95,000
 

Tessella delivers software engineering and consulting services to leading pharmaceutical and biotech companies. We are recruiting Software Engineersto work with skilled bioinformaticians and scientists to identify business needs and recommend and develop technical solutions. Applicants require BS, MS or PhD in bioinformatics, biology or chemistry and 2+ years of software development in either: Java, C#, C++, C or VB.NET. 

Apply at http://jobs.tessella.com   

 

oxford nanopore logo 


 Early Access Collaborations Managers
Oxford Nanopore Technologies is developing a novel technology, GridIONTM for the direct, electronic analysis of DNA/RNA and other analytes.  As the system approaches the market, we are building a team of technically knowledgeable, highly motivated candidates with excellent customer service and facilitation skills to join our company as Collaboration Managers.  This is a unique opportunity to work with world-leading genomics customers throughout the early adoption phase of a new generation of DNA sequencing technology.. This is a facilitative, enabling role with responsibility for managing technology development collaborations with key customers at leading genomics institutions.  It will include long term management of the collaboration plan and milestones and associated meetings and documentation. Click here to find out more and apply   

Oxford Nanopore's GridION technology, VP, Sales and Marketing Oxford Nanopore Technologies is a fast-moving technology company that is developing a novel electronic molecular analysis technology. The technology is adaptable for the analysis of DNA/RNA, proteins, chemicals and other molecules.  It is therefore suitable for use in a variety of markets including scientific research and clinical applications.  As the technology approaches the market, Oxford Nanopore is seeking a visionary VP of sales and marketing to join the senior team.  The candidate will embrace the opportunities afforded by entering the market with a truly disruptive technology that has the potential to expand the number of users and the variety of applications in each target market.  This is a rare opportunity to influence the commercial strategy at an early phase of its commercial lifetime, in a well funded company.  Oxford Nanopore welcomes applications from candidates with a track record of high-level strategic commercial  leadership, who wish to apply a fresh approach to existing markets.  Experience in Life Sciences/DNA sequencing is central to this role, however we will consider your application if you have experience of disruptive technologies in other related industries.  We are particularly interested in candidates with strong expertise in the use of digital technologies for sales and marketing of scientific/technical products.  Click to  Apply  


 

For reprints and/or copyright permission, please contact  Tim McLucas, (781) 972-1342, tmclucas@healthtech.com .