August 30, 2008
| Bio-IT World > Sequence Sudoku
Sequence Sudoku


May 15, 2007 | The basis of Applied Biosystem’s new SOLiD platform differs from traditional Sanger sequencing in several respects. First and foremost, it uses DNA ligases to anneal fluorescently tagged DNA cassettes to the template, rather than polymerases. But the ingenious twist is the way that sequence is actually interrogated and read out.

Each SOLiD probe interrogates two adjacent bases at a time, not a single base. But as it uses only four fluorescent tags, each color tag matches four possible dinucleotide pairings (see panel A). For example, blue corresponds to AA, CC, TT, or GG. Thus, any color by itself cannot reveal the identity of a particular base. But once the identity of the first base is known, then the rest fall neatly into position.

Sudoku-A(x360).jpg
Click for a larger version

For example, if the first base is an A, and the first color is blue, then the first two bases must be A-A (because the other possibilities are by definition excluded). Now lets turn to the second color — green. Since we just deduced that the second base in the sequence is an A, we deduce the identity of the third base by reading out which green dinucleotide pair starts with an A. Thus, the third base is C. This process continues for the read length of 25 bases (see panel B).

Sudoku-B(x360).jpg

Although the individual read lengths are very small, the system offers some significant advantages. The system neatly distinguishes between naturally occurring variants (SNPs) and random sequencing errors. In most sequencing systems, there is no way to discern a genuine SNP versus a sequence error. But in SOLiD, a random sequence error will only change a single colored tag. A genuine SNP, by contrast, is revealed as changes in consecutive color tags, because the altered base occurs in two successive nucleotide pairings.  -- K.D.

Return to main article.


Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

White Papers & Special Reports

isilon white paper

“Storage for Science – Methods for Managing Large and Rapidly Growing Data Stores in Life Science Research Environments” sponsored by Isilon
Large and rapidly growing stores of file-based and other data are a hallmark of life science research and bioinformatics. Determining how best to manage those data stores has become a significant challenge for Researchers and IT Pros alike.

This paper is intended to:

  • Provide guidance on the many storage requirements common to Life Science research;
  • Explain the evolution of modern storage architectures;
  • Summarize the major data storage architectures currently in use.

Additionally, it will present the Isilon IQ clustered storage product as a strong and flexible solution to those needs. Download now



definiens briefingon-76Next-Generation Technologies Revolutionizing Oncology and Diagnostics
underwritten by Definiens

This “Briefing On” collection of Bio-IT World features, commentaries and analysis, presents some of the latest thinking on high-throughput technologies that are being applied to the fields of research and drug discovery, with particular emphasis on oncology, diagnostics and imaging technologies. Download now at no charge compliments of the underwriting sponsor, Definiens. Download This Free Paper



metaminer image(1)

MetaMiner™ Cystic Fibrosis Report,  Sponsored by GeneGo
This paper discusses the MetaMiner™ (CF) data analysis platform for a broad range of CF researchers designed to: 1. Easily assemble important biological and chemical experimental data available today in cystic fibrosis research. 2. Visualize key mechanisms leading to the disease through pathway maps and network models 3. Provide the CF community a “one stop shop” tool for uploading and analyzing experimental data in a disease-centered interface.  Download now 



Life Science Webcasts & Podcasts

Storage for Science
Methods for Managing Large and Rapidly Growing Data Stores in Life Science Research Environments

Sponsored by Isilon

Isilon webcast1

Large and rapidly growing stores of file-based and other data are a hallmark of life science research and bioinformatics environments. Determining how best to manage those data stores has become a significant challenge for the Researchers and IT Professionals that support them.

This webcast is intended to: 

  • Provide guidance on the many storage requirements common to Life Science research; 
  • Explain the evolution of modern data storage architectures; 
  • Summarize the major data storage architectures currently in use;
  • Present the Isilon IQ clustered storage product as a strong and flexible solution to those needs.

    Download this webcast

More Podcasts

Job Openings

Isilon Systems ~ Senior Marketing Communications Manager
Isilon Systems is the worldwide leader in clustered storage systems and software for digital content and unstructured data. We seek an experienced marketing communications professional/writer expert in creating and delivering effective and persuasive business communications. The ideal candidate can think at the strategic and conceptual level and act, simultaneously, as a highly-effective and productive individual contributor. The position is based in Seattle, WA. For additional information click here:
 

Lilly Singapore Center for Drug Discovery (LSCDD) - Associate Director of Informatics
Lead and mentor a strong team for the Bioinformatics group at the Integrative Computational Sciences (ICS) department at LSCDD towards the development of novel algorithms, data analysis methods and software tools for drug discovery. Work closely with the Software Engineering group at ICS, and collaborate with the Discovery IT organization in Europe and USA. For additional information, or to apply visit: LSCDD 

For reprints and/or copyright permission, please contact RMS, 1808 Colonial Village Lane, Lancaster, PA;

(717) 399-1900 ext. 125 or via email to bio-itworld@theygsgroup.com.