Lilly’s Grid Goes Open Source


The pharma's Discovery IT platform is freely available.

By Kevin Davies

July 14, 2008 | In an unusual move for big pharma, Eli Lilly has made its Discovery IT platform—known internally as the Lilly Science Grid (LSG)—open source. The initiative could spark new interest within the biopharma community for sharing pre-competitive content and software.

“This is something that John Reynders was interested in and has been in the works for a while,” says Susie Stephens, principal research scientist with Lilly. (Reynders left Lilly for Johnson & Johnson last year; see Reynders Takes the CIO Reins at J&J R&D, Bio-IT World, January 2008). The move addresses a key question: “Do [biopharma] all really need to manage very similar resources or are there areas where we can work together?”

LSG, which began in 2005, has a core framework to enable users to build plug-ins to create applications. “This is our core discovery IT framework, and was developed on the biology side, but has extended into chemistry internally,” explains Stephens. The current lead technologist is Andy Ring.

The software is available on Sourceforge.net. “It’s all well and good developing plug-ins internally for the framework, but if there are many people within the broader life sciences community doing that, it could make it much more interesting,” she says.

The publicly available version of LSG — dubbed Life Sciences Grid—includes a select group of “less proprietary” plug-ins, including those for Gene Browser, NCBI Entrez, and Gene Ontology. “We’re also kick-starting some collaborations, working with people in academia to develop plug-ins,” says Stephens. Indeed, she says at times Lilly’s open source ambitions are exceeding those of some potential academic partners. “We’d like to see many people building plug-ins, some of which we wouldn’t have thought to have done.”

But why go to the trouble? Says Stephens: “We’re looking to become much more networked as a company, so we’re looking hard at what we consider proprietary and what we must keep to ourselves, and what are we actually doing that other pharma companies are also doing that we consider to be pre-competitive.

“We don’t think it makes financial sense for all pharma and biotech companies to all be developing a core discovery IT framework... I’m not aware of a huge number of projects where pharma is making code available open source,” adds Stephens. She cites PISTOIA—a new initiative to share data models, and web-service interfaces in areas such as cheminformatics, which is also available through Sourceforge.

Stephens, who formerly was something of a Semantic Web guru at Oracle, is enjoying her matchmaking role at Lilly. She focuses chiefly on open innovation, identifying areas “where it makes sense for Lilly to establish external collaborations” and gaps in the company’s in-house capabilities or resources where it would make sense to find collaborators, in areas such as informatics and the Semantic Web. Another goal is to open a small “open innovation center,” probably in Cambridge, Mass., spun off from Lilly that would focus on pre-competitive collaborations across pharma, again in areas such as software and discovery IT.

LSG can be found at www.sourceforge.net (search “LSG”), under a Berkeley Software Development license.  

What is LSG?

The Life Science Grid is a software infrastructure that Lilly developed internally for drug discovery programs. LSG is a plug-in hosting and deployment framework that sits on top of Microsoft’s Composite Application Block. LSG is a rich client that requires .NET 2.0 or higher. The framework simplifies the task of creating new plug-ins by providing a Visual Studio template from which developers can quickly learn and expand. Users can choose which applications and plug-ins to use within an integrated environment. Within Lilly, Stephens says LSG is mainly used by bioinformaticians and computational biologists, but it has many additional capabilities, including target assessment. She stresses its value as a platform for interoperability. People interested in working with the open source version can take the code and develop enhancements and modifications; there are restrictions on further commercialization. LSG used to be dependent on Oracle databases, but now works equally well on MySQL.   --K.D.

 

 

___________________________________________________

This article appeared in Bio-IT World Magazine.

Subscriptions are free for qualifying individuals.  Apply Today.

 

 

Reynders Takes the CIO Reins at J&J R&D
Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1

White Papers & Special Reports

isilon white paper

“Storage for Science – Methods for Managing Large and Rapidly Growing Data Stores in Life Science Research Environments” sponsored by Isilon
Large and rapidly growing stores of file-based and other data are a hallmark of life science research and bioinformatics. Determining how best to manage those data stores has become a significant challenge for Researchers and IT Pros alike.

This paper is intended to:

  • Provide guidance on the many storage requirements common to Life Science research;
  • Explain the evolution of modern storage architectures;
  • Summarize the major data storage architectures currently in use.

Additionally, it will present the Isilon IQ clustered storage product as a strong and flexible solution to those needs. Download now



definiens briefingon-76Next-Generation Technologies Revolutionizing Oncology and Diagnostics
underwritten by Definiens

This “Briefing On” collection of Bio-IT World features, commentaries and analysis, presents some of the latest thinking on high-throughput technologies that are being applied to the fields of research and drug discovery, with particular emphasis on oncology, diagnostics and imaging technologies. Download now at no charge compliments of the underwriting sponsor, Definiens. Download This Free Paper



metaminer image(1)

MetaMiner™ Cystic Fibrosis Report,  Sponsored by GeneGo
This paper discusses the MetaMiner™ (CF) data analysis platform for a broad range of CF researchers designed to: 1. Easily assemble important biological and chemical experimental data available today in cystic fibrosis research. 2. Visualize key mechanisms leading to the disease through pathway maps and network models 3. Provide the CF community a “one stop shop” tool for uploading and analyzing experimental data in a disease-centered interface.  Download now 



Life Science Webcasts & Podcasts

Storage for Science
Methods for Managing Large and Rapidly Growing Data Stores in Life Science Research Environments

Sponsored by Isilon

Isilon webcast1

Large and rapidly growing stores of file-based and other data are a hallmark of life science research and bioinformatics environments. Determining how best to manage those data stores has become a significant challenge for the Researchers and IT Professionals that support them.

This webcast is intended to: 

  • Provide guidance on the many storage requirements common to Life Science research; 
  • Explain the evolution of modern data storage architectures; 
  • Summarize the major data storage architectures currently in use;
  • Present the Isilon IQ clustered storage product as a strong and flexible solution to those needs.

    Download this webcast

More Podcasts

Job Openings

Isilon Systems ~ Senior Marketing Communications Manager
Isilon Systems is the worldwide leader in clustered storage systems and software for digital content and unstructured data. We seek an experienced marketing communications professional/writer expert in creating and delivering effective and persuasive business communications. The ideal candidate can think at the strategic and conceptual level and act, simultaneously, as a highly-effective and productive individual contributor. The position is based in Seattle, WA. For additional information click here:
 

Lilly Singapore Center for Drug Discovery (LSCDD) - Associate Director of Informatics
Lead and mentor a strong team for the Bioinformatics group at the Integrative Computational Sciences (ICS) department at LSCDD towards the development of novel algorithms, data analysis methods and software tools for drug discovery. Work closely with the Software Engineering group at ICS, and collaborate with the Discovery IT organization in Europe and USA. For additional information, or to apply visit: LSCDD 

For reprints and/or copyright permission, please contact RMS, 1808 Colonial Village Lane, Lancaster, PA;

(717) 399-1900 ext. 125 or via email to bio-itworld@theygsgroup.com.