Lilly’s Grid Goes Open Source



The pharma's Discovery IT platform is freely available.

By Kevin Davies

July 14, 2008 | In an unusual move for big pharma, Eli Lilly has made its Discovery IT platform—known internally as the Lilly Science Grid (LSG)—open source. The initiative could spark new interest within the biopharma community for sharing pre-competitive content and software.

“This is something that John Reynders was interested in and has been in the works for a while,” says Susie Stephens, principal research scientist with Lilly. (Reynders left Lilly for Johnson & Johnson last year; see Reynders Takes the CIO Reins at J&J R&D, Bio-IT World, January 2008). The move addresses a key question: “Do [biopharma] all really need to manage very similar resources or are there areas where we can work together?”

LSG, which began in 2005, has a core framework to enable users to build plug-ins to create applications. “This is our core discovery IT framework, and was developed on the biology side, but has extended into chemistry internally,” explains Stephens. The current lead technologist is Andy Ring.

The software is available on Sourceforge.net. “It’s all well and good developing plug-ins internally for the framework, but if there are many people within the broader life sciences community doing that, it could make it much more interesting,” she says.

The publicly available version of LSG — dubbed Life Sciences Grid—includes a select group of “less proprietary” plug-ins, including those for Gene Browser, NCBI Entrez, and Gene Ontology. “We’re also kick-starting some collaborations, working with people in academia to develop plug-ins,” says Stephens. Indeed, she says at times Lilly’s open source ambitions are exceeding those of some potential academic partners. “We’d like to see many people building plug-ins, some of which we wouldn’t have thought to have done.”

But why go to the trouble? Says Stephens: “We’re looking to become much more networked as a company, so we’re looking hard at what we consider proprietary and what we must keep to ourselves, and what are we actually doing that other pharma companies are also doing that we consider to be pre-competitive.

“We don’t think it makes financial sense for all pharma and biotech companies to all be developing a core discovery IT framework... I’m not aware of a huge number of projects where pharma is making code available open source,” adds Stephens. She cites PISTOIA—a new initiative to share data models, and web-service interfaces in areas such as cheminformatics, which is also available through Sourceforge.

Stephens, who formerly was something of a Semantic Web guru at Oracle, is enjoying her matchmaking role at Lilly. She focuses chiefly on open innovation, identifying areas “where it makes sense for Lilly to establish external collaborations” and gaps in the company’s in-house capabilities or resources where it would make sense to find collaborators, in areas such as informatics and the Semantic Web. Another goal is to open a small “open innovation center,” probably in Cambridge, Mass., spun off from Lilly that would focus on pre-competitive collaborations across pharma, again in areas such as software and discovery IT.

LSG can be found at www.sourceforge.net (search “LSG”), under a Berkeley Software Development license.  

What is LSG?

The Life Science Grid is a software infrastructure that Lilly developed internally for drug discovery programs. LSG is a plug-in hosting and deployment framework that sits on top of Microsoft’s Composite Application Block. LSG is a rich client that requires .NET 2.0 or higher. The framework simplifies the task of creating new plug-ins by providing a Visual Studio template from which developers can quickly learn and expand. Users can choose which applications and plug-ins to use within an integrated environment. Within Lilly, Stephens says LSG is mainly used by bioinformaticians and computational biologists, but it has many additional capabilities, including target assessment. She stresses its value as a platform for interoperability. People interested in working with the open source version can take the code and develop enhancements and modifications; there are restrictions on further commercialization. LSG used to be dependent on Oracle databases, but now works equally well on MySQL.   --K.D.

 

 

___________________________________________________

This article appeared in Bio-IT World Magazine.

Subscriptions are free for qualifying individuals.  Apply Today.

 

 

Reynders Takes the CIO Reins at J&J R&D
Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1



White Papers & Special Reports

sgi whp 2
Managing the Modern Genomics Data Flood
Sponsored by SGI

Managing and storing the perfect storm of multi-disciplined data pouring from next generation sequencers and other omics instruments is a central challenge in life sciences. Discover in this paper how the SGI ArcFiniti storage solution, optimized for unstructured genomics and life sciences data can: 

  • Reduce costs, proactively protect data integrity, and deliver the high performance I/O required for genomics data processing and analysis.  
  • Effectively manage capacities from 156TB to 1.4PB as a disk based, integrated hardware and software platform 


sgi - whp 1
Turning Genomics Data into Practical Insight
Sponsored by SGI

With worldwide sequencing capacity approaching 13 quadrillion DNA bases annually turning genomics data into knowledge is a true computational challenge. Read this paper and learn how the SGI UV coherent shared memory platform can:  

  • Speed results time while cost competitively tackling the most difficult computational problems across all omics disciplines. 
  • Push performance by scaling to extraordinary levels, up to 256 sockets (2,560 cores, 4,096 threads) per single system (one OS image). 

Provide support for up to 16TB of coherent shared memory in a single system image enabling extreme efficiency across a wide range of compute demands. 



accerlys-logo_2012_wh
New Complimentary Market Survey…
Collaborations and Communications Within Drug Discovery Research
Sponsored by Accelrys
This survey was conducted by the Cambridge Healthtech Media Group in January, 2012. It was sponsored by Accelrys related to their HEOS initiative to gather valid information around externalizing collaborative research while improving communications in the cloud. With 310 qualified industry respondents the survey findings reveal useful usage and trends patterns.  An insightful follow-on discussion and webinar related to this survey, and the HEOS by Scynexis SaaS portal is also available on the Bio-IT World website for complementary viewing.
 


Job Openings

tessella logo 
Scientific Software Engineer
Boston MA
$70,000 to $95,000
 
Apply at http://jobs.tessella.com   

oxford nanopore logo 


Early Access Collaborations ManagersClick here to find out more and apply   

Oxford Nanopore's GridION technology, VP, Sales and Marketing Click to  Apply  

For reprints and/or copyright permission, please contact  Tim McLucas, (781) 972-1342, tmclucas@healthtech.com .