Download the authoritative guide: Enterprise Data Storage 2018: Optimizing Your Storage Infrastructure
Veteran storage managers who have been successfully protecting data for years, if not decades, sometimes get tripped up by the peculiarities of virtual environments. According to Kroll Ontrack, 40% of enterprises lose information every year from virtual environments, and about two thirds find that they are not able to recover all their virtual data in the event of a disaster.
This goes against the grain of popular perception. Most believe that storing data in virtual environments decreases the risk of data loss. Here are some tips on how to better protect, backup and recover data in virtual environments:
1) Understand the Difference in Environment
Those more used to physical data protection should first come to a firm understanding of how the virtual environment is different. Because of higher consolidation of servers in virtual environments, data loss due to a single server failure may be much greater.
“A hardware failure for a physical server would result in one server going down, while in virtual environments, hardware failure of a host would result in all virtual machines (VMs) going down and the data stored within them, on the host, being lost,” said Sergey Kandaurov, Director of Product Management at Acronis.
2) Protect the Hypervisor
Further, virtualization adds one more layer that needs to be protected – the hypervisor. Downtime in a virtualized environment may be caused not only by server failure, but also by problems with the virtualization infrastructure, such as hypervisor failure. Thus, the hypervisor has to be safeguarded, too.
“Products that are not specifically designed for the hypervisor you’re using won’t work efficiently, and might even do harm,” said Kandaurov. “Ensure that the backup solution you choose is capable of backing up a hypervisor with its configuration and allows easy and fast recovery.”
3) Don’t Rely on the Virtual Platform
Today’s virtual platforms come with all sorts of bells and whistles. This includes their own data protection and recovery mechanisms. But users would be wise to not only rely on them.
“Don’t think that clustering and high availability mechanisms provided by virtualization platforms like vSphere and Hyper-V reliably protect you from data loss,” said Kandaurov. “They help to minimize downtime in some cases, but they’re not replacements for a proper backup strategy.”
4) Monitor the Backup Performance Hit
In the physical world, you usually have one application per server, and that server is likely not running anywhere near capacity. In the virtual world, there generally are way more than that, and the host they're all running on is much closer to capacity. This can exert a serious impact on performance.
“That excess capacity can be used to run the backup without impacting the performance of the application that much, but if the excess capacity does not exist, then that's an issue,” said Eric Burgener, an analyst at IDC.
“Our data today shows that on average each host has about 10 Virtual Machines (VMs), and that is going to a little more than 12 by 2017. If you try to back all of them up at once, you're going to impact performance on probably all of them, so you have to come up with a way to still perform the backups in a timely manner without unduly impacting performance.”
5) Non-Sequential Backups
That’s why the industry is moving away from sequential, file-based backups in virtual computing because they are too demanding in terms of CPU and network resources during the backup window and they take too long. Users are looking at other approaches that minimize the opportunity to impact the performance of production applications. A popular one is off-host snapshot backups, added Burgener. With this approach, you create a snapshot of each VM then mount each snapshot on a backup appliance or server, and back up the snapshot.
“While this can still have a bit of an impact, it is far less than backing up straight from a server,” said Burgener
6) Find the Right API
There are Application Programming Interfaces (APIs) that administrators can use to create snapshot backups - VMware offers VMware APIs for Data Protection (VADP), Microsoft offers Windows Volume Shadowcopy Services (VSS), Oracle offers Recovery Manager (RMAN), for example. All major backup products (appliances and software) can take advantage of these APIs and use them to create snapshots they can back up off-host.