Make Your Data Sing – Literally



By John Russell

June 27, 2008 | Human hearing is wonderfully discriminating. Most of us effortlessly notice when a musical instrument drifts out of tune or a piece of music breaks out of harmony. The ‘bad’ notes stick out. What’s more, the simple tones we hear are actually many overtones blended together – in this sense music acts as a kind of natural integrator of complex data.

Gil Alterovitz, a research fellow at Harvard/MIT Division of Health Science and Technology (in Children's Hospital Informatics Program) and a research affiliate, MIT Computer Science and Artificial Intelligence Laboratory, has been pioneering efforts to turn data – gene expression levels, for example – into musical applications that could be used in biomedical research, diagnostics, and for an endless variety of monitoring systems. To a significant extent the secret sauce is made of clever algorithms for data compression and transduction into notes.

You might, for example, construct a four-note chord, in which each of the notes is built from very many data points, weighted in various ways so that as data values increase or decrease, individual notes could sharpen, flatten, or become completely dissonant. The changing pitches, abrupt or gradual, could tell a lot about the state of underlying process being described, and in near real-time if data is captured quickly enough. Surgeons in an operating room could easily follow some set of parameters as they vary, while still performing surgery.

“If one thing goes off, that doesn’t mean that the alarm really should go off; and what happens many times now [in operating rooms] is many people are ignoring the alarms,” says Alterovitz. “With this method a number of things have to go wrong for you to really start noticing that there is a problem. And if a number of things are going wrong it gradually will sound inharmonious and capture this fuzzy nature of it.”

Sonic signatures can be useful in biomedical research, and Alterovitz has done so in a colon cancer study.

“It is impossible to keep track of all of the genes at any given time,” he says. “Each gene is one dimension. So, 10 thousand genes have 10 thousand dimensions. It is also possible to represent a group of genes in fewer dimensions by using mathematical compression. In the colon cancer case, I condensed 3,142 genes to four linear combinations.”

Here is Alterovitz’ explanation:  Each linear combination is a different dimension of those of 3,142 genes. Each linear combination is assigned a different note.

Note one = collection of all 3,142 genes in a particular dimension 1.

Note two = collection of all 3,142 genes in a particular dimension 2.

Note three = collection of all 3,142 genes in a particular dimension 3.

And so on, for a total of four different notes (representing four different dimensions of the same 3,142 genes).

“So, when playing three or more notes at the same time, we get chords. When played across time, we get music,” explains Alterovitz. “Using Pythagorean tuning mathematics, we assigned the normal to be harmonious. So, then, deviations (like disease) turn out to naturally sound inharmonious. Thus, when we listen to harmonious music, it means the person is healthy; when we listen to inharmonious music it could be a sign of a pathological problem.”

In the colon cancer case, the growing inharmonious sound represented changes in gene expression (up or down) over time. In a diagnostic or treatment setting, training your ear might catch cancerous changes more quickly than other pathology methods. Alterovitz envisions many diverse applications for this sonification of data 

“When someone goes into shock, there are all of these cytokines that get released in the blood stream. We look at some of these end products now, but if you could look at them much earlier [it would be helpful], or when someone has a stroke there are all these particles released that actually cause more damage than the actual stroke after a certain amount of time. If we could see ahead of time, because genes are kind of like the blueprint, their expression determines ahead of time what will happen. It will allow us to monitor earlier what is going on, and that way we can have a treatment plan before [the problem] actually occurs,” says Alterovitz.

Next Step: Commercialization
If you smell a new company emerging, you’re right. Alterovitz and his colleagues have been talking to venture capitalists in recent months about how best to commercialize the technology. Children’s Hospital is also looking at some applications. One question is how to handle the intellectual property rights.

“That was an interesting discussion. We have this possibility of applying for a patent within one year of a certain date. But when we were talking to the venture capitalists, they were saying that we may not want to do a patent, we may want to keep it as a trade secret, the actual code and algorithm details, the fine details of it, because once you apply for a patent, everything is out there and anyone can just write a program. Epecially if you use something very efficient like MATLAB, you could write it very quickly.”

Indeed, Alterovitz had praise for the widely used MATLAB from The MathWorks. He stressed that its ability to link easily to many databases “is very useful, particular when you need to collaborate with experts in music, who did not necessarily have expertise in database querying and management.”

So far there’s been no shortage of ideas for potential applications. Cockpits in airplanes, for example, might benefit from a “musical monitoring” system. He was contacted by someone wondering if this might be applied in submarine sonar systems, making them easier to interpret.

One point of discussion has been how to generate cash flow from the start – a seeming prerequisite for getting funding nowadays.

“At the beginning you don’t really have a product, necessarily, right? But there are some applications that may not be as medically inspiring, but will bring in a revenue stream,” says Alterovitz. “For example, I’ve had many people express an interest in getting music, related to either their DNA information or even if it’s not their own genetic information, just general gene-based music. There’s a DJ who we’re working with who wants to play it in the Boston dance clubs around here. There are companies that license this type of material. That is something that can kind of quickly bring in cash flow and also potentially some branding for [us].”

Rock on!

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1



White Papers & Special Reports

sgi whp 2
Managing the Modern Genomics Data Flood
Sponsored by SGI

Managing and storing the perfect storm of multi-disciplined data pouring from next generation sequencers and other omics instruments is a central challenge in life sciences. Discover in this paper how the SGI ArcFiniti storage solution, optimized for unstructured genomics and life sciences data can: 

  • Reduce costs, proactively protect data integrity, and deliver the high performance I/O required for genomics data processing and analysis.  
  • Effectively manage capacities from 156TB to 1.4PB as a disk based, integrated hardware and software platform 


sgi - whp 1
Turning Genomics Data into Practical Insight
Sponsored by SGI

With worldwide sequencing capacity approaching 13 quadrillion DNA bases annually turning genomics data into knowledge is a true computational challenge. Read this paper and learn how the SGI UV coherent shared memory platform can:  

  • Speed results time while cost competitively tackling the most difficult computational problems across all omics disciplines. 
  • Push performance by scaling to extraordinary levels, up to 256 sockets (2,560 cores, 4,096 threads) per single system (one OS image). 

Provide support for up to 16TB of coherent shared memory in a single system image enabling extreme efficiency across a wide range of compute demands. 



accerlys-logo_2012_wh
New Complimentary Market Survey…
Collaborations and Communications Within Drug Discovery Research
Sponsored by Accelrys
This survey was conducted by the Cambridge Healthtech Media Group in January, 2012. It was sponsored by Accelrys related to their HEOS initiative to gather valid information around externalizing collaborative research while improving communications in the cloud. With 310 qualified industry respondents the survey findings reveal useful usage and trends patterns.  An insightful follow-on discussion and webinar related to this survey, and the HEOS by Scynexis SaaS portal is also available on the Bio-IT World website for complementary viewing.
 


Job Openings

tessella logo 
Scientific Software Engineer
Boston MA
$70,000 to $95,000
 

Tessella delivers software engineering and consulting services to leading pharmaceutical and biotech companies. We are recruiting Software Engineersto work with skilled bioinformaticians and scientists to identify business needs and recommend and develop technical solutions. Applicants require BS, MS or PhD in bioinformatics, biology or chemistry and 2+ years of software development in either: Java, C#, C++, C or VB.NET. 

Apply at http://jobs.tessella.com   

 

oxford nanopore logo 


 Early Access Collaborations Managers
Oxford Nanopore Technologies is developing a novel technology, GridIONTM for the direct, electronic analysis of DNA/RNA and other analytes.  As the system approaches the market, we are building a team of technically knowledgeable, highly motivated candidates with excellent customer service and facilitation skills to join our company as Collaboration Managers.  This is a unique opportunity to work with world-leading genomics customers throughout the early adoption phase of a new generation of DNA sequencing technology.. This is a facilitative, enabling role with responsibility for managing technology development collaborations with key customers at leading genomics institutions.  It will include long term management of the collaboration plan and milestones and associated meetings and documentation. Click here to find out more and apply   

Oxford Nanopore's GridION technology, VP, Sales and Marketing Oxford Nanopore Technologies is a fast-moving technology company that is developing a novel electronic molecular analysis technology. The technology is adaptable for the analysis of DNA/RNA, proteins, chemicals and other molecules.  It is therefore suitable for use in a variety of markets including scientific research and clinical applications.  As the technology approaches the market, Oxford Nanopore is seeking a visionary VP of sales and marketing to join the senior team.  The candidate will embrace the opportunities afforded by entering the market with a truly disruptive technology that has the potential to expand the number of users and the variety of applications in each target market.  This is a rare opportunity to influence the commercial strategy at an early phase of its commercial lifetime, in a well funded company.  Oxford Nanopore welcomes applications from candidates with a track record of high-level strategic commercial  leadership, who wish to apply a fresh approach to existing markets.  Experience in Life Sciences/DNA sequencing is central to this role, however we will consider your application if you have experience of disruptive technologies in other related industries.  We are particularly interested in candidates with strong expertise in the use of digital technologies for sales and marketing of scientific/technical products.  Click to  Apply  


 

For reprints and/or copyright permission, please contact  Tim McLucas, (781) 972-1342, tmclucas@healthtech.com .