Integrating HP Data Protector software with
HP Data Deduplication Solutions
An analysis of how to implement data deduplication technologies
utilizing HP Data Protector software
Executive summary ............................................................................................................................... 2
Solution description .............................................................................................................................. 2
What is data deduplication? ................................................................................................................. 3
The benefits of data deduplication ......................................................................................................... 4
Details to know about object-level differencing ........................................................................................ 4
How much space does data deduplication really save? ............................................................................ 4
Example: 500 GB file server backup .................................................................................................. 4
Deduplication portfolio strategy from HP ................................................................................................. 6
Where does HP Data Protector software fit into the picture? ...................................................................... 7
HP Data Protector Advanced Backup to Disk Licensing ......................................................................... 7
Verify capacity for VTL ......................................................................................................................... 8
Licensing example ............................................................................................................................... 8
VTL configuration ............................................................................................................................. 8
For more information .......................................................................................................................... 10
Executive summary
This white paper provides complementary information on data deduplication technologies supported by the
latest Storage Solutions from HP. Data deduplication is a hot topic in data protection and therefore, also a
relevant topic for HP Data Protector software.
Solution description
HP Data Protector software is a backup and disaster recovery product that provides reliable data protection
and high availability for your expanding mission critical data. HP Data Protector network component
concept provides for tailor-made backup and recovery solutions ranging from a single system to thousands
of systems across multiple sites. HP Data Protector software fully supports HP data deduplication
technologies allowing you to recover files more quickly while reducing your data management and storage
costs. Data deduplication can increase your storage efficiency by a ratio of 50:1—that’s up to 5000%! The
extra capacity allows you to keep more backup data online and ready to restore at a moment’s notice.
Overall, the increase in storage efficiency brought about by deduplication lets you do more for less.
2
What is data deduplication?
Data deduplication is the ability of an appliance or software to compare blocks of data being written to the
backup device with data blocks previously stored on the device. If duplicate data is found, a pointer is
established to the original data, rather than storing the duplicate data sets. This removes, or
“deduplicates,” the redundant data blocks. Data deduplication is done at the block or chunk level, not at
the file level.
This greatly reduces the volume of data stored.
Data deduplication is often used in conjunction with other forms of data reduction, such as conventional
data compression, to further reduce the data volume stored.
The best approach to data deduplication depends on your size and backup needs.
• Deduplication for enterprises: Object-level differencing, or accelerated deduplication, is a
good choice for enterprise customers because it focuses on performance and scalability. It delivers
the fastest restores, as well as the fastest possible backup by deduplicating data after it has been
written to disk. You can scale up to increase performance simply by adding extra nodes.
• Deduplication for midsize businesses and remote enterprise sites: Hash-based
chunking,
enterprises with remote sites because it focuses on compatibility and cost. It delivers a low-cost,
small footprint in a format-independent solution.
or dynamic deduplication, is a good choice for small and midsize businesses or large
A detailed description about deduplication techniques can be found in the “Understanding the HP
Data Deduplication Strategy” HP white paper at :
http://h71028.www7.hp.com/ERC/downloads/4AA1-9796ENW.pdf
Figure 1 shows the principal deduplication concept.
Figure 1: Deduplication Concept
3