TriPrint Method Watermarks DNA


Malcolm Simons proposes codon changes to identify DNA.

By Graeme O’Neill

July 14, 2008 | MELBOURNE — In the fall of 2001, five people died in the anthrax attacks in New York and Boca Raton, Florida. Although the crime remains unsolved, the Bacillus anthracis strain used in the attacks was virtually identical to one held at the U.S. Army Medical Research Institute for Infectious Diseases (USAMRIID) in Fort Detrick, Maryland. The episode highlights the need for a means of readily distinguishing modified strains of bacteria, viruses, cell lines, and transgenic plants and animals, from naturally occurring forms.

Australian immunologist Malcolm Simons—who made headlines a few years ago with his controversial “junk DNA” patent (see, “Simons Says”)—is not the first to propose a method for “watermarking” proprietary DNA, but his idea is simple… and waterproof.

An obvious way to watermark a gene or genome is to insert a distinctive, non-coding DNA sequence. J. Craig Venter recently watermarked the name of his research institute into the genome of the prototype of the world’s first synthetic living organism, a genetically minimal Mycloplasma. His group inserted a synthetic sequence of triplet codons to spell out “VenterInstitvte”, along with his own name and the names of several colleagues, using the single-letter abbreviations of the 20 amino acids. (The amino acid “alphabet” lacks six of the 26 letters in the English alphabet, hence Venter’s substitution of a valine (V) in “Institvte”.)

“The issue is conserved sequence,” says Simons. “Everyone understands that it is only possible to hide cryptic information in conserved sequences. There are ultra-conserved sequences in non-coding DNA, but within the protein-coding region of a gene, any change in a single nucleotide may change the function of the protein.”

Simons’ solution, called TriPrint, exploits the variation available in synonymous codons within protein-coding exons to “imprint” the watermark within the gene, without changing the encoded protein. The four-base DNA code yields 64 unique triplet codons, which specify 20 amino acids. Each amino acid has between two and six alternatives, and even the universal “stop” codon that terminates genes has three variants: UAA, UAG and UGA. TriPrint would combine synonymous variants of the stop codon, and serine codons in each protein-coding exon, to create unique watermarks. (Altering just the first or second nucleotide in the triplet provides five alternatives to any wild-type serine codon).

Patented System
“The stop codon is least the mutable codon, and the least mutable amino acid of the 20 has got to be serine,” he said. “You would have to change two nucleotides in a transversion manner and the probability of simultaneous substitutions in the same serine is extremely remote.”

He said that a watermark pattern comprising just the first occurrence of serine in each exon of the gene, plus the stop codon, would provide many more unique permutations than required to distinguish between all naturally occurring variants of the gene. Indeed, even in a small gene with just a handful of exons, the number of potential permutations rapidly approaches Avogadro’s number.

“In practice, it might be sufficient just to modify the first serine codon in each exon of the gene, plus the stop codon,” Simons said. “The particular combination of serine variant and the stop codon could even serve to identify the company that created the gene.”

Simons’ TriPrint patent, filed with the Australian Patent Office on April 17, relates specifically to transversion mutations—changes that substitute a purine (A or G) for a pyrimidine (T or C) at any of the three codon positions. “The improbability of a dual mutation in the first two nucleotides can be regarded as infinitesimally small,” he said. “There is no need for an error-checking mechanism.”

Simons proposes that DNA watermarks would be recorded in a central database. “Given the concerns of some consumers about ‘contamination’ of conventional crops by pollen from genetically modified crops, TriPrint provides a way of individually watermarking every transgenic variety,” he said.

 A DNA watermark in the presumptive Fort Detrick strain of B. anthracis might not have prevented the anthrax attack, but Simons said the ability to rapidly trace a transgenic microbe to its source might give terrorists pause for thought. 

Simons Says

Malcolm Simons invented a controversial patent on the use of conserved DNA markers in non-coding DNA to identify haplotypes associated with inherited disorders (see, “Malcolm in the Middle,” Bio•IT World, August 2003). Simons asserts that the field of genome-wide association mapping was founded on his discovery that non-coding SNPs occur as haplotypes that enable gene mapping without pedigrees. In 2000, Simons quit Genetic Technologies, the Australian biotechnology company he co-founded, and relinquished his interest in both patents. He subsequently co-founded Haplomic Technologies to develop techniques to isolate and characterize the separate haploid DNA contributions of parents to their offspring, a prerequisite to understanding how the two “haplomes” interact to create phenotypically unique offspring. --G.O.

_________________________________________________

This article appeared in Bio-IT World Magazine.

Subscriptions are free for qualifying individuals.  Apply Today.

 

 

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

White Papers & Special Reports

waters integrated
Integrated Software Approach to Streamline Method Development Workflow
Sponsored by Waters
Traditional chromatography method development evaluates numerous chromatographic variables that may include selection of the column, solvent, flow rate, etc. This application note demonstrates an integrated approach to capturing and storing separation related reports during chromatography method development.


HP white paper image
Extreme Storage Knowledge Center
Sponsored by HP

Visit HP’s Extreme Storage Knowledge Center to find informative, complimentary white papers, case studies, videos, product information and more.  Brief overview of topics:

  • The challenges of unstructured storage and how to manage both cost-effectively and efficiently
  • Company case studies of data storage challenges that translate across pharmaceutical and biotech companies today
  • Systems that manage vast amounts of data with simple deployment, unified management, and extreme scalability at an exceptionally low price per terabyte
  • Life sciences data management; viable solutions for small and large companies to manage growing storage demands
  • Take our virtual product tour and see our storage unit from inside out


Coupa white paper 92
10 Secrets to Recession-Proof Your Business
Sponsored by Coupa


Read this white paper to discover 10 strategies smart companies deploy to recession-proof their business.
Leaders generally face hard choices on how to mange a company during an economic downturn and
behave in one of three ways:
1) “The ostrich” - Preserve the status quo/hope for the best
2) “The bull in the china shop” - Blindly cut expenses across the board
3) “The fox” - Use the downturn to make your business more effective and position it for future growth

Learn how to behave “like a fox” and use a recession as a means to pounce on emerging trends.



Life Science Webcasts & Podcasts

Medidata Solutions

Rising Clinical Trial Delays and Costs - Addressing the Cause, Not the Symptoms 

medidata podcastProtocol complexity is taking a toll on clinical study speed and efficiency: increasingly complicated and ambitious protocols are not only burdening sites and study volunteers but are also prolonging trials and increasing expenses. In response, sponsors have turned to global study placement, restructured site relationships and new site management practices, but the problem remains.

This podcast will discuss:

  • Why these responses address only the symptoms, not the underlying cause, of rising clinical trial delays and costs.
  • Results of a recent joint Tufts University / Medidata Solutions study.
  • New metrics benchmarking protocol design trends.
  • Systematic protocol design improvements and why they are essential to clinical trial performance excellence.

Speakers: Ken Getz, Senior Research Fellow at the Tufts Center for the Study of Drug Development, and Ed Seguine, General Manager, Trial Planning Solutions at Medidata.

Download Now 



More Podcasts

Job Openings

Manager, Scientific Computing & Programming
Lead SAIC-Frederick, Inc.’s Bioinformatics & Analysis Group in developing & maintaining informatics pipelines for generation/analysis of dense genotyping & next-generation sequencing data. Required:  MS or equiv.  5 yrs related experience.  Knowledge of programming/software development, high performance computing, bioinformatics, project management. Visit www.saic-frederick.com - #130019.

For reprints and/or copyright permission, please contact The YGS Group, 1808 Colonial Village Lane, Lancaster, PA;

(717) 399-1900 ext. 125, or via email to Ashley.Zander@theYGSgroup.com.