The Genomic Ark: Collecting the Genomes of 10,000 Species



Loading...

By Allison Proffitt

November 4, 2009
| SINGAPORE—An international team of scientists aims to sequence the genomes of 10,000 vertebrate species, male and female, in a project that could affect every aspect of biology.

Known as the Genome 10K Project, the project was conceived by David Haussler, professor of biomolecular engineering at UC Santa Cruz and a Howard Hughes Medical Institute investigator; Stephen J. O'Brien, chief of the Laboratory of Genomic Diversity at the National Cancer Institute; and Oliver A. Ryder, director of genetics at the San Diego Zoo's Institute for Conservation Research and adjunct professor of biology at UC San Diego.

In April, 55 scientists representing major zoos, museums, research centers and universities around the world hashed out the details. Today almost 70 scientists worldwide are involved and the specifics of the proposal will be published in the Journal of Heredity.

The plan is simple. Gather tissue and DNA specimens from living mammals, birds, reptiles, amphibians, and fishes, and some recently extinct species. When possible, gather data from both males and females, and reflect geographic diversity within a species. Then sequence all of it.

It is a “bold proposal,” says Byrappa Venkatesh, head of the Comparative Genomics Laboratory at Singapore’s Institute of Molecular and Cell Biology and the chairman of the project’s “fish committee.” “The first bottleneck was to identify the species and get hold of tissue samples,” Venkatesh tells Bio-IT World. “We had a meeting in April 2009… and now we’ve cleared that first hurdle. We have more than 10,000 species identified.”

In fact, the group has more than 16,000 proposed species in their database after the April meeting. Venkatesh’s fish committee proposed 4000 species of fish, because fish make up 50% of living vertebrates. The collected samples are housed with more than 50 institutions all over the world.

The results will change the field of biology. The data will lay a foundation for understanding the genetic basis of recent changes within vertebrate species and between closely related species. Results will be analyzed to reveal evolutionary changes and help predict how species will respond to climate change, pollution, emerging diseases, and invasive competitors.

“We are capturing what evolution left us with before the human population started impacting the species—a set of genomes inclusive of the biota that a magnificent evolutionary process has produced,” said lead author Stephen O’Brien.

Now the group, calling themselves the Genome 10K Community of Scientists (G10KCOS), needs to raise money. Funding will hopefully come from the NIH and other agencies, foundations, and conservation groups around the world.

And of course there’s the technology. The Genome 10K committee is counting on the cost of sequencing to fall below $5000. At that price, 10,000 genomes will be possible, but it’s not there yet.

“It was the same when they started the Human Genome Project,” Venkatesh says. “The technology developed in parallel with the project and this will be the same.” Venkatesh says that the group doesn’t have a clear timeline yet, but, “it should take about five years once we have the money and start distributing the samples.”

For Venkatesh, the project has been a long time coming. He has had recent success sequencing the fugu (or pufferfish) and elephant shark genomes, but he’s been hoping for something on a much larger scale. “I’ve been collecting fish samples for the last 16 years,” he says. “I had a list of fishes I wanted to sequence, but we didn’t have the funds. This is what I was hoping for.”

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

White Papers & Special Reports

Quantum
StorNext 4.0: Technical Product Brief
Sponsored by Quantum

 
Proven in the world’s most data intensive industries, Quantum StorNext is a scalable, high-performance file system which allows data sharing across Linux, Mac, Unix, and Windows operating systems and manages data in enterprise storage environments. In this Technical Brief you'll learn:

  • How a high-performing file system can accelerate your business
  • How to simplify your data management
  • How a tiered storage approach can save you money


SURETY-IP_WPx108
Protect Your Scientific Intellectual Property: Proof of Lab Informatics Data Authenticity is Your Best Legal Defense
Sponsored by Surety, LLC

As a bio-technology or life sciences organization, your formulas, treatments and research and discoveries are the “lifeblood” of your business. But if you aren't protecting the integrity of your scientific data in your lab informatics systems, you risk losing IP ownership, revenue and consequently your business if you can't prove time-of-creation and data authenticity. Learn how you can implement simple, cost-effective and automated controls to protect your scientific intellectual property. Consider:

  • IP protection requirements in bio-pharma and other science-oriented industries can extend out 20, 30, 40 or more years
  • Most electronic lab management solutions include generic authenticity controls, so how "legally defensible" is yours?
  • Only standards-compliant, independent controls can future-proof your approach to long-term IP integrity protection and authenticity.
  • Learn more - get the free whitepaper now


BlueArc_WP_DataMigration.jpg
The Key to Life Sciences Data Management: Transparent Migration
Sponsored by BlueArc

Life sciences organizations face new data management challenges as the volume of research data grows and more data is kept online for longer times. Read this paper to learn about:

  • The benefits of transparent data migration (TDM)
  • How TDM technologies can simplify data management.
  • How using TDM can help increase storage utilization, improve computational workflow performance, and optimize the use of storage resources.


Life Science Webcasts & Podcasts

adobe_i3_btn_webinarNext-Generation Clinical Trial and Data Management Applications
Sponsored by Adobe

This webinar introduces i3Cube - a web-based, fully integrated, clinical trial and data management system built on Adobe’s LiveCycle® Enterprise Suite.  I3 cube provides end-to-end automation that delivers unprecedented visibility into information that sponsors need to accelerate the study process and complete trials efficiently. Viewers will learn more about:

  • Creating faster and more efficient trial processes
  • Reducing investigator burden 
  • Real-time sponsor transparency into study information
  • Enterprise solutions based on Adobe LiveCycle® ES utilizing cross-platform clients of Reader, Flash and AIR

    Download now.



More Podcasts

Job Openings

Employers -- Don't miss this opportunity to reach well-qualified life science candidates.

Loading...

For reprints and/or copyright permission, please contact The YGS Group, 3650 West Market Street, York, PA;

(717) 505-9701 ext. 125, or via email to Ashley.Zander@theYGSgroup.com.