Last Post
Last few years saw explosive growth in server virtualization market. Indeed, Server Virtualization has a number of benefits, including:
However, virtualization also creates new challenges, including how to backup virtual machines. Backing up a virtual machine may use a lot of storage and network resources. When a host machine hosts multiple virtual machines, backing up all virtual machines can become more challenging. If all virtual machines are being backed up at the same time, it may dramatically slow down the server performance.
DriveHQ’s Cloud Backup service offered as part of the Cloud IT solution has successfully backed data for tens of thousands of businesses. Our efficient and reliable backup software has also backed up many VMs for large enterprise customers. This article will talk more about the details of virtual machine backup methods, strategies, advantages and disadvantages, and why DriveHQ’s cloud backup solution is far better than other backup solutions.
As with any backup implementation, you need to decide what data needs to be backed up. Traditionally, there are Image Backup and Data Backup.
It backs up the entire system image (or disk image). With this approach, the main advantage is if the operating system failed and cannot boot, the backup software can restore the entire system including user data. The disadvantages are Image Backup often takes much longer, requires much more storage space to keep the backup copies, and it is hard to restore individual files. Moreover, image backup usually can only be restored to the same computer with the same hardware configuration.
It usually does not backup the operating system and program files. Instead it backs up user data files. With this approach, if the operating system fails, the user can use a system recovery disk to restore the operating system, then reinstall other programs, then use the backup software to restore all user data. The advantages of data backup are it usually takes much less time, requires much less storage space and it is much easier to restore individual files and folders.
This probably is the simplest method. If you back up the entire host server, then all guest VMs are also backed up. Since the virtual disk files are constantly being modified by the guest OS, the backup software needs to take a snapshot of the virtual disk files. With Volume Shadow Copy technology, this is not a problem.
The real disadvantages are very similar to what mentioned in Section 1.1. Because one host server can host many VMs, some of the disadvantages become even harder to cope with.
Compared with method (1), this method does not need to backup the host server itself. If there is only one VM hosted by the host server, the backup speed can be up to 100% faster. If the server hosts many VMs, the speed gain is less significant.
If the host server is used solely for hosting the guest VMs, then backing up the host server itself is not important. Instead of backing up the host server using Image Backup, the host server can be easily restored using the recovery disk without any backup.
Another option is to create an Image Backup to only backup the host server without the guest VMs. Because the files on the host server does not really affect the real data, it only needs to be backed up one-time or at a much lower frequency (e.g. once per year or once per month).
In one aspect, a virtual machine is just like a physical machine. So it can also be backed up using regular Image Backup. As the virtual machine’s disk image files are actually the VHD files, there is little difference between Method (3) and method (2). Of course if the Virtualization software does not support VSS, then Method (3) needs to be used. Luckily, all mainstream virtualization solution supports VSS.
Method (2) works better from manageability point of view: instead of creating backup tasks in each guest VM, the backup can be performed centrally from the host server.
Method (3) works better from portability point of view. A virtual machine might be moved from one host server to another host server. Nothing needs to be changed if Method (3) is used and the backup destination is a network mapped drive.
Two of the main advantages of virtualization are “fast provisioning” and “portability”. With these two advantages, Image Backup is generally not needed:
First of all, the host server is relegated to a bare-bone standard configuration with just the operating system and the virtualization software. Instead of restoring the host server, it can be easily restored using the recovery disk without backup. No user data or business data will be lost. Secondly, the guest OS can also be easily restored without backup. In fact, a pre-provisioned guest OS loaded with company-specific software can be running (or backed up) as a standby VM. In case the main guest VM fails, instead of restoring the entire guest VM, you only need to boot the standby VM and restore the user data to the standby VM. This can dramatically reduce the system down-time.
With the above analysis, given a little bit of preparation, Image Backup is generally not needed to backup your Host Servers and Virtual Machines – you might still need image backup, but usually it is just one-time backup, or recurring backup at a very low frequency.
As mentioned above, Data Backup is far more efficient and flexible than Image Backup.
Backing up virtual machines to the cloud poses another set of challenges. Using Image Backup, it could create very huge backup files – a single file can be as large as 200GB or even larger. Even if you only backup VHD files, the VHD files can also be very large – it can be as large as 100GB or even larger. When a single file gets so large, it becomes hard to upload, download and manipulate. Eventually, it will be too inefficient or unreliable to backup such files online.
To mitigate the above problem, incremental backup can be used. However, incremental backup must be extremely carefully designed and tested to work well with cloud-based backup. Even so, the results may vary dramatically based on the real scenario.
Incremental backup can reduce the amount of data to be backed up in many cases; however, it has the following disadvantages:
DriveHQ has successfully backed VMs for many large enterprise clients. To make the backup process efficient and smooth, we recommend the following best practices. While these are not necessarily required, they can certainly optimize the performance, reliability and dramatically lower the cost.
We recommend separating the OS volume, application volume and data volume, creating three VHDs for OS, apps and data. At a minimum, we recommend creating two VHDs, one for the OS, the other one for the apps and data; or one for the OS and apps, and the other one for the data.
Moving the memory swap file or page file to the apps or data volume can help minimize the amount of data on the OS volume.
With the above approach, you can minimize the VHD file size for the OS volume. For Windows XP or Windows Server 2003, the OS volume only needs about 16GB or even smaller; for Windows 7 / Server 2008 R2, the OS volume only needs about 32GB or even smaller.
With proper preparation, the VHD file containing the operating system can be relatively small (less than 32GB). The file can be easily backed up locally, or backed up to the Cloud Storage. This VHD only needs to be backed once as the OS rarely changes or such changes will not affect your backup or restore.
Backup of the OS VHD may not be necessary as you can reinstall the OS, or use a recovery disk to restore the OS. On the other hand, having the OS VHD handy can reduce the time to restore the OS.
If you do decide to backup your VM’s operating system to DriveHQ cloud storage, you just need to run DriveHQ Online Backup client software from the host server, create a backup task using the Database / Email Backup Wizard, then select the backup source folder to backup the virtual machine configuration files and the OS VHD file, but exclude other VHD files. You can exclude other VHD files from the Manage Tasks à Edit Backup Task window.
If the software applications are installed in the same OS VHD, then it is already backed up with the OS.
Assuming you have the software installation disk, then backing up the apps VHD is not necessary. Again, having the apps VHD ready can reduce the time to restore the software apps.
If you do decide to backup your VM’s software applications to DriveHQ cloud storage, you just need to run DriveHQ Online Backup client software from the host server, create a backup task using the Database / Email Backup Wizard, then select the backup source folder to backup the Apps VHD file.
In this case, a virtual machine is exactly the same as a physical machine. You just need to:
Compared with other cloud backup solutions, DriveHQ’s Virtual Machine Backup solution has the following advantages:
Facebook
20,080
Twitter
1263
Google +
963