Cloud Storage Alternatives to Amazon and Google



COMMENTARY

By Sajai Krishnan

June 22, 2009 | Cloud computing is in the press a lot these days. What's all the fuss about? To seasoned IT administrators, this looks like the latest shiny new thing that IT vendors are pushing. You can’t blame users who roll their eyes and think, “Here they go again.”

But suspend your disbelief for a moment, there is something here. As the Economist noted in March, “something deeper is going on in the computer industry. Thanks to ever more powerful chips and new software, servers and other hardware can now be “virtualised”, meaning physically separate systems can act as one.”

This is true of compute clouds and storage clouds and something that could not have been done 3-4 years ago. I think of cloud computing as the 4th wave of computing—after mainframes, client-servers, and 3-Tier-Web. By the year 2015, cloud computing is going to be 20-30% of IT deployment, taking a spot alongside mainframes in the way that IT is deployed. 

Cloud computing is an overarching term. Simplistically, this tiers into three layers (see figure). Cloud applications, also known as Software-as-a-Service (SaaS), were well established before the industry called it ‘cloud’ anything. The cloud compute layer is where most of the vendor activity is today. This is because the server virtualization plays have done the heavy lifting already, allowing a proliferation of value-add plays.

ParaScale
Cloud storage is a relatively new concept that is becoming a more recognizable term among enterprises. Cloud compute and cloud storage are analogous in that both are bringing the benefits of commodity economics at very large scales in an easy-to-manage service model. And both can start really small, say with as few as 3-4 servers.

For many people, the term cloud storage conjures up an image of a giant public cloud storage service provider like Amazon S3 or the Google File System(GFS). However, there is more to cloud storage than massive globe-spanning storage clouds where one can rent storage capacity by the month. IT shops in the bio-pharma space or university research groups can easily deploy internal storage clouds in their environment, which can provide better performance than typical public cloud services like Amazon S3. These internal storage clouds can even be paired with the demanding compute environments that are often seen in these market sectors. Such clouds can start small, and scale as and when required.

Take, for example, the private cloud built by the team at the Stanford Genome Technology Center. This team had been struggling with ways to manage the rapidly increasing amounts of data generated by their genome sequencers. Each run of the sequencer analyzed human DNA and produced images varying in size from 100 gigabytes (GB) to 4 terabytes (TB) over a few days. These images were then fed into a high-performance compute (HPC) cluster for further analysis.

It is economically challenging to store these vast amounts of data on traditional storage solutions from the major vendors. Not only is it expensive, even managing the simplest mainstream SAN (storage area network) or NAS (network attached storage) solution requires a team of experienced IT staff. The Stanford team considered a storage cloud solution that enabled them to economically store data and easily scale based on their needs without management overhead. Without a dedicated IT staff, their solution needed to be simple enough to be managed by molecular biologists, not computer scientists.
 
Storage clouds directly deliver Moore’s Law economies for the vexing problem of managing exploding file storage costs in the bio-pharma space. Whether enterprises choose to build a petabyte scale cloud with hundreds of nodes for all their high-definition images or dip their toe in the water and build a small cloud is up to them. However, storage clouds within the enterprise are catching on; all the tools to build a private cloud are available online. In fact, with some understanding of Linux, deploying and managing storage clouds is even easier than deploying more traditional SAN or NAS solutions.


Sajai Krishnan is the CEO of ParaScale, a cloud storage start-up.
Click here to login and leave a comment.  

1 Comments

  • Avatar

    This is a great summary of cloud storage service vendors! But as always smaller companies that help to manage cloud storage are forgotten ;-) CloudBerry Explorer for Amazon S3 is an example of a small tool that makes managing cloud storage easier. With FTP like client it makes managing files in S3 EASY http://cloudberrylab.com/ It supports most of the Amazon S3 and CloudFront features and It is a freeware.

Add Comment

Text Only 2000 character limit

Page 1 of 1



White Papers & Special Reports

sgi whp 2
Managing the Modern Genomics Data Flood
Sponsored by SGI

Managing and storing the perfect storm of multi-disciplined data pouring from next generation sequencers and other omics instruments is a central challenge in life sciences. Discover in this paper how the SGI ArcFiniti storage solution, optimized for unstructured genomics and life sciences data can: 

  • Reduce costs, proactively protect data integrity, and deliver the high performance I/O required for genomics data processing and analysis.  
  • Effectively manage capacities from 156TB to 1.4PB as a disk based, integrated hardware and software platform 


sgi - whp 1
Turning Genomics Data into Practical Insight
Sponsored by SGI

With worldwide sequencing capacity approaching 13 quadrillion DNA bases annually turning genomics data into knowledge is a true computational challenge. Read this paper and learn how the SGI UV coherent shared memory platform can:  

  • Speed results time while cost competitively tackling the most difficult computational problems across all omics disciplines. 
  • Push performance by scaling to extraordinary levels, up to 256 sockets (2,560 cores, 4,096 threads) per single system (one OS image). 

Provide support for up to 16TB of coherent shared memory in a single system image enabling extreme efficiency across a wide range of compute demands. 



accerlys-logo_2012_wh
New Complimentary Market Survey…
Collaborations and Communications Within Drug Discovery Research
Sponsored by Accelrys
This survey was conducted by the Cambridge Healthtech Media Group in January, 2012. It was sponsored by Accelrys related to their HEOS initiative to gather valid information around externalizing collaborative research while improving communications in the cloud. With 310 qualified industry respondents the survey findings reveal useful usage and trends patterns.  An insightful follow-on discussion and webinar related to this survey, and the HEOS by Scynexis SaaS portal is also available on the Bio-IT World website for complementary viewing.
 


Job Openings

tessella logo 
Scientific Software Engineer
Boston MA
$70,000 to $95,000
 

Tessella delivers software engineering and consulting services to leading pharmaceutical and biotech companies. We are recruiting Software Engineersto work with skilled bioinformaticians and scientists to identify business needs and recommend and develop technical solutions. Applicants require BS, MS or PhD in bioinformatics, biology or chemistry and 2+ years of software development in either: Java, C#, C++, C or VB.NET. 

Apply at http://jobs.tessella.com   

 

oxford nanopore logo 


 Early Access Collaborations Managers
Oxford Nanopore Technologies is developing a novel technology, GridIONTM for the direct, electronic analysis of DNA/RNA and other analytes.  As the system approaches the market, we are building a team of technically knowledgeable, highly motivated candidates with excellent customer service and facilitation skills to join our company as Collaboration Managers.  This is a unique opportunity to work with world-leading genomics customers throughout the early adoption phase of a new generation of DNA sequencing technology.. This is a facilitative, enabling role with responsibility for managing technology development collaborations with key customers at leading genomics institutions.  It will include long term management of the collaboration plan and milestones and associated meetings and documentation. Click here to find out more and apply   

Oxford Nanopore's GridION technology, VP, Sales and Marketing Oxford Nanopore Technologies is a fast-moving technology company that is developing a novel electronic molecular analysis technology. The technology is adaptable for the analysis of DNA/RNA, proteins, chemicals and other molecules.  It is therefore suitable for use in a variety of markets including scientific research and clinical applications.  As the technology approaches the market, Oxford Nanopore is seeking a visionary VP of sales and marketing to join the senior team.  The candidate will embrace the opportunities afforded by entering the market with a truly disruptive technology that has the potential to expand the number of users and the variety of applications in each target market.  This is a rare opportunity to influence the commercial strategy at an early phase of its commercial lifetime, in a well funded company.  Oxford Nanopore welcomes applications from candidates with a track record of high-level strategic commercial  leadership, who wish to apply a fresh approach to existing markets.  Experience in Life Sciences/DNA sequencing is central to this role, however we will consider your application if you have experience of disruptive technologies in other related industries.  We are particularly interested in candidates with strong expertise in the use of digital technologies for sales and marketing of scientific/technical products.  Click to  Apply  


 

For reprints and/or copyright permission, please contact  Tim McLucas, (781) 972-1342, tmclucas@healthtech.com .