July 20, 2008
| Bio-IT World > Pack It In
Pack It In


Oct. 10, 2007 | One way to address the data management issue is to store data more efficiently so that it takes up less space and is easier to query. That is the general idea behind a new database from start-up Vertica.

The company was founded by life sciences veteran Andy Palmer and database veteran Michael Stonebraker. Palmer was most recently CIO and senior vice president at Infinity Pharmaceuticals. He also served as president of the Interoperable Informatics Infrastructure Consortium (I3C). Stonebraker was the main architect of the INGRES relational DBMS, and the object-relational DBMS, POSTGRES.

Most databases are optimized to handle a large number of updates. The Vertica Database is a general-purpose relational database system designed to provide extremely good performance on read-intensive query workloads.

“In many [industries], there are applications and uses of database technology where people spend much more time reading rather than writing to a database,” said Palmer. “I figured there was an opportunity to build from scratch an SQL database for read-only mode.”

The database organizes data on disk as columns of values from the same attribute, as opposed to storing it as rows of tabular records. This means that when a query needs to access only a few columns of a particular table, only those columns need to be read from disk. Conversely, in a row-oriented database, all values in a table are typically read from disk, which wastes I/O bandwidth.

Storing data in the column-oriented manner improves performance. “Because of the way the data is represented, queries can be completed in reasonable times,” said Palmer.

The Vertica Database also uses aggressive compression of data on disk, as well as a query execution engine that is able to keep data compressed while it is operated on. “Because of [the] significant compression, [it] is much more efficient allowing you to keep more data,” said Palmer.

According to Vertica, these technologies help execute queries much faster than traditional relational database management systems and require significantly less storage space.

Palmer notes that the technology is well suited to life sciences applications such as those that tag data using the World Wide Web Consortium’s Resource Description Framework (RDF). -- S.S.


Return to main article.

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

White Papers & Special Reports

definiens briefingon-76Next-Generation Technologies Revolutionizing Oncology and Diagnostics
underwritten by Definiens

This “Briefing On” collection of Bio-IT World features, commentaries and analysis, presents some of the latest thinking on high-throughput technologies that are being applied to the fields of research and drug discovery, with particular emphasis on oncology, diagnostics and imaging technologies. Download now at no charge compliments of the underwriting sponsor, Definiens. Download This Free Paper



gq nxt gen seq

This Bio•IT World Briefing On “Next-Generation Sequencing,” underwritten by GenomeQuest, Inc.,
presents a selection of feature stories, interviews,commentaries, conference reports, and editorials on the emergence, opportunities, and challenges posed by high-throughput sequencing. Covered in this collection: the launch of new platforms from Applied Biosystems and Helicos; new applications of nextgen sequencing; the rise of personal genomics; and informatics solutions to vexing problem of managing the vast volumes of next-gen data.  Download now 



Life Science Webcasts & Podcasts

GenoLogicsgenologics 2 translational
Enabling Translational Research Informatics

Learn about the challenges facing life sciences research labs to manage their translational research data:

  • The trends for organizations to adopt informatics solutions for translational research.
  • The unique requirements with managing complex data and workflow.
  • What labs should consider when reviewing informatics solutions for translational research.
  • Which life sciences research organizations are successfully adopting an informatics solution.

Download Now



More Podcasts

Job Openings

Assistant Editor (Science Writer)~Cambridge Healthtech Institute (CHI), Needham, MA, 
Cambridge Healthtech Institute seeks an assistant editor (science writer) who is an ambitious, dependable journalist who can fulfill a range of writing and editorial duties for a series of eNewsletters covering various aspects of the biopharmaceutical industry in addition to CHI’s flagship publication, Bio-IT World magazine.  This is a superb opportunity to make important contributions to the growth and success of a multimedia science publishing group, while gaining invaluable experience in multiple facets of the publishing industry.   Interested candidates should submit a cover letter, including 3 writing samples (attached in Word or PDF format), salary history or requirements, and resume to kdavies@healthtech.com. 

Fred Hutchinson Cancer Research Center: IT Business Analyst III
The Hutchinson Center is the only National Cancer Institute-designated comprehensive cancer center in the Pacific Northwest. Through our Tumor Research Initiative, we are finding new ways to detect tumors at an early stage.  We are presently seeking an experienced IT Business Analyst to assess technology needs for the Tumor Research Initiative, and to identify and design improvements to computer based systems.  For more information please visit www.fhcrc.org and search for Job# AD-21465

For reprints and/or copyright permission, please contact RMS, 1808 Colonial Village Lane, Lancaster, PA;

(717) 399-1900 ext 100 or via email to bio-itworld@theygsgroup.com.