The 2005 Database Explosion

By BIO-IT World

RESEARCH TOOLS · More than 700 'open' databases now available from around the world

BY KEVIN DAVIES

February 15, 2005 | The 2005 compendium of molecular biology databases compiled and published by Nucleic Acids Research shows a dramatic increase of 171 databases from 2004, bringing the new total up to 719.

The compendium, which is restricted to freely available life science databases that do not require downloading of special software (because of firewall restrictions), is accessible online at nar.oupjournals.org/ cgi/content/full/33/suppl_1/#TBL1.

Michael Galperin, an investigator at the National Center for Biotechnology Information at the NIH who coordinated the new compendium, says the compendium shows that "the open database movement is here to stay, and more and more people in the community (as well as in the financing bodies) now appreciate the importance of open databases in spreading knowledge."

This year's 12th annual database issue includes 719 databases, organized into 14 categories. The list includes new entries from countries such as Brazil, Cuba, Estonia, Greece, Hungary, and Malaysia.

Of the 548 databases featured in last year's compilation, 17 have been dropped from the list because they have been discontinued, merged into larger ones, or converted to commercial access. Galperin says that databases that offer valuable content "usually manage to survive, even if they have to change their funding scheme or migrate from one host institution to another."

The profusion of databases has little to do with personal credit, and certainly not financial remuneration. "Disk space is relatively cheap these days and database maintenance tools are fairly straightforward, so that a decent database can be created on a shoestring budget, often by a graduate student or as a result of a postdoctoral project," Galperin says.

While getting such projects off the ground is easy, maintaining and developing these resources with little or no funding requires "a commitment that can only be applauded" — particularly for scientists for whom English is not their native language.

The volume of immunology-related databases required the creation of a new category, largely in response to the genome project-fueled growth in data on immuno-polymorphisms, as well as a sprouting of plant-related genome databases.

Galperin's accompanying commentary is available at nar.oupjournals.org/cgi/ content/full/33/suppl_1/D5.* 






White Papers & Special Reports

thomson reuters image
Biomarkers: An Indispensible Addition to the Drug Development Toolkit
Examining the Potential of Biomarkers
Sponsored by Thomson Reuters

Biomarkers are becoming an essential part of clinical development. In this white paper, Thomson Reuters provides insight from experts in industry and academia, and explores the role of biomarkers as evaluative tools in improving clinical research and the challenges this presents.

Discover the potential of biomarkers to:

  • Improve decision making
  • Accelerate drug development
  • Reduce development costs


BlueArc_Scientific Data
Scientific Data Lifecycle Management: Preparing for Storage in an Uncertain Future
Sponsored by BlueArc

Managing vast and overwhelming streams of gene sequencing data today requires ultra-high performance systems and processes. With continued rapid advancement and improvements in gene sequencing, expect tomorrow’s instruments to output quantities of genomic information that will dwarf current levels. Help your organization maintain data control and prepare for the future of sequencing through this informative paper that discusses:

  • The information technology challenges of gene sequencing
  • “Intelligent” methods for data management and customization
  • System survival tips... Deciding what data to keep or delete
  • New tools to keep scientists ahead of impending data torrents


SAS Managed image
Managed Innovation, Assured Compliance
Developing, executing and managing the transformation, analysis and submission of clinical research data with SAS® Drug Development
Sponsored by SAS
Get better products to market faster. Download this white paper to discover the top ten challenges facing life science executives and how to overcome them. See how SAS Drug Development transforms clinical data into true innovation.


Life Science Webcasts & Podcasts

Presented by Trade Commission of Spain

Spain Biotech: An Engine for Economic Change 

TCS podcastDiscover how Spain is focusing on biotechnology to be an engine for economic change through gradual internationalization, development and technology transfer.

Regional governments are actively investing in public and private biology research and promoting the creation of knowledge-based companies. Spain’s human capital combined with aggressive investment in biotech research and infrastructure has led to the creation of bio-clusters.

Today, there are nearly 700 Spanish companies engaged in biotechnology, with almost 50 percent growth in funding devoted to research. In fact, spending on internal R & D in biotechnology has grown 46 percent and is close to 300 million Euros.

Access the podcast 

 



More Podcasts

Job Openings

saic_logo

MANAGER, SCIENTIFIC COMPUTING & PROGRAMMING
(Bioinformatics Manager)
SAIC-Frederick, Inc has an exciting opportunity for a Manager, Scientific Computing & Programming - Core Genoytyping Facility in Gaithersburg, Maryland.  In this role, you will lead the Bioinformatics & Analysis Group.
Master’s or equivalent required.  PhD preferred. Six years experience in development of scientific programs in high-performance computing environment including five years supporting scientific research in computational chemistry, biology, or genetics, & two years supervisory experience.  View complete job posting & apply: www.saic-frederick.com. Position #146945.

For reprints and/or copyright permission, please contact The YGS Group, 1808 Colonial Village Lane, Lancaster, PA;

(717) 399-1900 ext. 125, or via email to Ashley.Zander@theYGSgroup.com.