ITPRO

Printed from www.itpro.co.uk

Register to receive our regular email newsletter at http://www.itpro.co.uk/registration.

The newsletter contains links to our latest IT news, product reviews, features and how-to guides, plus special offers and competitions.

Skip to navigation

    Genome project reduces storage requirements

The Durham-based research centre has reduced its VMware-based genomic information storage requirements by 83 per cent with data deduplication technology.

By Miya Knights, 9 Oct 2008 at 15:15

Genome study

The Duke Institute for Genome Sciences and Policy (IGSP) has revealed it has achieved a dramatic reduction in the virtualised storage capacity of its ever-growing genomic research information.

The Durham-based institute is dedicated to the study of life through scientific advancement in genetics and genomics. It requires substantial amounts of data storage for the creation and analysis of genomic information.

Over a two-year period, this saw its data storage infrastructure requirements rocket from 4TB to 300TB, made up of large Oracle databases on dedicated servers and locally attached storage.

Rob Wagner, Duke IGSP database administrator said: "We just bought more disks and kept giving them to Oracle. Over time this proved to be very difficult to manage. Server sprawl and inefficient data storage use became a constant headache."

In order to solve these issues, the IGSP IT team implemented a server virtualisation project to consolidate 40 physical servers down to just three running the Oracle applications. At the same time, it also migrated and consolidated its locally attached storage to a networked fabric attached storage (FAS) system from NetApp.

The project enabled the institute to use data deduplication techniques to identify and eliminate redundant data across applications, as the technology is a key component of the NetApp core operating architecture, Data ONTAP.

Alan Cowles, Duke IGSP system administrator explained: "Previously, I spent more time managing 60TB to 70TB of direct-attached storage than I do managing the 225TB of NetApp storage we have now.“

The current IGSP environment consists of a 40-node high-compute server farm, three VMware ESX servers and 50 mixed-use application servers, where all data is provisioned from a NetApp FAS3070 clustered storage system.

Cowles said deduplication now plays a major role in managing data growth. “When we originally set up VMware, I allocated about 2.4TB for it. With NetApp deduplication, I've been able to shorten that down to less than 700GB. We now see an average of 83 per cent reduction in redundant data on our VMware systems."

Email to a friend

Print this page

Social Bookmark this article: What is this?

Be the first to comment on this article

You need to Login or Register to comment.

advertisement
advertisement

    Latest News Videos in Storage

Video: Steve Murphy, Hitachi Data Systems

Play Video: Steve Murphy, Hitachi Data Systems   Play

IT PRO speaks to Steve Murphy, UK Managing Director of storage technology specialist Hitachi Data Systems.

 

    White papers

Want more background on today's hottest IT trends?

Visit IT PRO's white paper library for more on virtualisation, encryption and other topics.

    Register for IT PRO

You'll get exclusive member benefits including free white papers, downloads, Webinars and weekly newsletters full of the latest IT PRO news, reviews, insight and expertise.

Advertisement