The Human Metabolome Project


By Kevin Davies

April 12, 2007 | This year, the journal Nucleic Acids Research published its 14th annual molecular biology database issue. The 2007 compendium included 174 databases, including 106 new entries and 68 updates. This brings the total number of databases in the NAR online Molecular Biology Database Collection to 968. That’s an impressive increase of 110 on the previous year.

Although each of these repositories has its diehard aficionados, one entrant has come in for some unusually heavy scrutiny lately. The Human Metabolome Database (HMDB) is the work of a 40-member all-Canadian team, led by David Wishart, of the department of computing science at the University of Alberta, Edmonton.

Motivated by the absence of a metabolomic equivalent of GenBank that could provide information and possibly even samples of metabolites, researchers secured $7.5 million funding from Genome Canada in 2005 for the “Human Metabolome Project.” Their goal is, “to improve disease identification, prognosis, and monitoring; provide insight into drug metabolism and toxicology; provide a linkage between the human metabolome and the human genome; and develop software tools for metabolomics.”

Metabolomics is the effort to identify and catalogue the thousands of metabolites found in human blood and urine, as well as other organisms. A full metabolite catalogue would improve efforts to understand the production of metabolic biomarkers, the course of disease progression, and the metabolic actions and toxicity of new drugs in preclinical research.

According to Wishart and colleagues, HMDB is “the most complete and comprehensive curated collection of human metabolite and human metabolism data in the world.” The database contains records for more than 2,500 human metabolites culled from “thousands of books, journal articles, and electronic databases,” and the Canadian team estimates the final tally will be more than double the current number. There are typically dozens of data fields, including synonyms, structural and physico-chemical data, NMR and MS spectra, disease associations, pathway information, sequence and SNP data, and external links. The HMDB is available at: www.hmdb.ca

“Fundamentally,” Wishart and colleagues write, “HMDB is a multi-purpose bioinformatics-cheminformatics-medical informatics database with a strong focus on quantitative, analytic or molecular-scale information about metabolites, their associated enzymes or transporters and their disease-related properties. HMDB combines the data-rich molecular biology content normally found in curated sequence databases such as SwissProt and UniProt with the equally rich data found in KEGG (about metabolism) and OMMBID (about clinical conditions).”

 But in London, Imperial College’s Jeremy Nicholson, the founding father of metabolomics, says the significance of the HMDB has been overblown. “It is just a database of compounds that were mostly known to be involved in human metabolism,” he says. “The project did not address the key issue — the variance of metabolites between tissues, cells, and biofluids. That is the important thing which classifies diseases and phenotypes.”

Last year, Nicholson’s team collaborated with scientists at Pfizer to publish a paper in Nature on “Pharmaco-metabonomic phenotyping and personalized drug treatment.” The goal is to to predict drug-related outcomes in humans by measuring metabolic signatures in body fluids. Indeed, Nicholson notes that humans vary considerably in their metabolic profile according to a host of factors, including age, sex, time of day, time of month (in women), diet, gut flora, pollutant exposure, ethnicity, fitness, and more. Nicholson acknowledges that the HMDB is “a useful catalogue,” but of limited novelty, he says. For him, HMDB “only forms the index for the book of the human metabolome — not the text.”

Wishart professes the utmost respect for Nicholson, but says criticism of HMDB as a mere list is unfair. "It would be like saying the Encyclopedia Brittanica is just a list or that GenBank is just a list," he says. His group has released a new database called FooDB (food components and additives), which complements HMDB and the DrugBank database. "These databases, if printed off, would be 100,000 pages long. They contain an enormous amount of biological, chemical, clinical, biochemical data." He adds, "We've had to upgrade our servers to deal with the heavy load," of some 200,000 hits per month.

See related article: The Human Metabolome Contretemps

Subscribe to Bio-IT World  magazine.

 

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

White Papers & Special Reports

definiens briefingon-76Next-Generation Technologies Revolutionizing Oncology and Diagnostics
underwritten by Definiens

This “Briefing On” collection of Bio-IT World features, commentaries and analysis, presents some of the latest thinking on high-throughput technologies that are being applied to the fields of research and drug discovery, with particular emphasis on oncology, diagnostics and imaging technologies. Download now at no charge compliments of the underwriting sponsor, Definiens. Download This Free Paper



gq nxt gen seq

This Bio•IT World Briefing On “Next-Generation Sequencing,” underwritten by GenomeQuest, Inc.,
presents a selection of feature stories, interviews,commentaries, conference reports, and editorials on the emergence, opportunities, and challenges posed by high-throughput sequencing. Covered in this collection: the launch of new platforms from Applied Biosystems and Helicos; new applications of nextgen sequencing; the rise of personal genomics; and informatics solutions to vexing problem of managing the vast volumes of next-gen data.  Download now 



Life Science Webcasts & Podcasts

GenoLogicsgenologics 2 translational
Enabling Translational Research Informatics

Learn about the challenges facing life sciences research labs to manage their translational research data:

  • The trends for organizations to adopt informatics solutions for translational research.
  • The unique requirements with managing complex data and workflow.
  • What labs should consider when reviewing informatics solutions for translational research.
  • Which life sciences research organizations are successfully adopting an informatics solution.

Download Now



More Podcasts

Job Openings

Assistant Editor (Science Writer)~Cambridge Healthtech Institute (CHI), Needham, MA, 
Cambridge Healthtech Institute seeks an assistant editor (science writer) who is an ambitious, dependable journalist who can fulfill a range of writing and editorial duties for a series of eNewsletters covering various aspects of the biopharmaceutical industry in addition to CHI’s flagship publication, Bio-IT World magazine.  This is a superb opportunity to make important contributions to the growth and success of a multimedia science publishing group, while gaining invaluable experience in multiple facets of the publishing industry.   Interested candidates should submit a cover letter, including 3 writing samples (attached in Word or PDF format), salary history or requirements, and resume to kdavies@healthtech.com. 

Fred Hutchinson Cancer Research Center: IT Business Analyst III
The Hutchinson Center is the only National Cancer Institute-designated comprehensive cancer center in the Pacific Northwest. Through our Tumor Research Initiative, we are finding new ways to detect tumors at an early stage.  We are presently seeking an experienced IT Business Analyst to assess technology needs for the Tumor Research Initiative, and to identify and design improvements to computer based systems.  For more information please visit www.fhcrc.org and search for Job# AD-21465

For reprints and/or copyright permission, please contact RMS, 1808 Colonial Village Lane, Lancaster, PA;

(717) 399-1900 ext 100 or via email to bio-itworld@theygsgroup.com.