What is de-duplication? It is a given that the current rate of data growth & state of the storage of today's data, every data center suffers the same challenge of redundant data existing across multiple platforms and locations within the organization. For example, think about the Q2 spreadsheet sent out by "Bob" from Accounting. It was approximately 5 MB in size. That same spreadsheet was received by over 100 employees. Each employee potentially saved it to their personal network drive for future reference. This set of data now exists in over 100 places -consuming 500 MB of space (1/2 a Gig). Then, the network drives are backed up. Not just once, but multiple times. At a minimum of 4 times a week (2 Gig of consumed storage space across disk and tape usually), assuming no one made any changes, if there was a change to any of the files, these updates would require the file to be backed up even more. Therefore, in one month from the day "Bob" sent this very helpful spreadsheet out to his employees, the organization's IT department now is managing an extra 2 Gig a month. This is just one file.
The Value? It's easy to see why this is an important topic to many leaders in IT today. It is addressed by an industry term commonly referred to as de-duplication. Covered in two sub categories- 1) Next Generation Backup and- 2) one instance storage (object based storage).
When it comes to de-duplication of data, IT leaders must consider two critical things.
#1> Does the manufacturer I am working with offer all types of deduplication methods? Or will they force me to use their particular feature, instead of the correct method that is appropriate for my datacenter?
#2> Can my manufacturer offer all three industry stand methods of de-duplication? Almost every manufacturer offers only one type of de-duplication. Always ask this question!
I am sure you have seen across the industry that Data De-Duplication is one of the hottest topics in IT today. As one of your strategic IT partners, I am happy to inform you that EMC has the widest product portfolio in the de-duplication market. This technology is something we have been leading the charge with for many years. As the need for better and more cost effective backup/recovery solutions is growing, so is the EMC product portfolio of solutions to address these challenges. If you are an Illinois or Indiana based company, my job as your dedicated EMC Account Executive is to let you know exactly what best of breed solutions are available to you. At EMC we are addressing the need for Data De-Duplication in the following THREE ways:
Three Data De-Duplication Methods
Source Based - ( EMC AVAMAR)
- Vmware environments
- Remote Office Backup
- Large file systems
Source-Based Data Sheet
Target Based- (EMC EDL 3D)
- Database environments
- Unique data
- Leverage existing backup software and processes
Target-Based Data Sheet
Object Based- (EMC CENTERA)
- Stale and static content
- Purpose built archive
- Compliance and regulatory needs
Object-Based Data Sheet
Please call me with any questions or information requests. I will follow up ASAP to discuss in more detail -or provide more information. Thank you.
Regards,
Steve