Virtualization is great for saving money and resources, but if you lose a virtual server you have larger bigger problems than if you had just lost a single server. While no company wants to put their business at risk by under-protecting their VMs, nobody likes to spend more than they have to on non-productive expenditures like insurance. The Small Business Administration divides businesses into two categories: those who have experienced a disaster and those who are about to experience one, and a study by CIO.com revealed that it costs 42 percent of companies $1,000 per hour of downtime and 26 percent of companies $10,000 per hour of downtime – with upper ranges at over $50,000 per hour.
For companies that are trying to figure out the best way to protect your virtual machines, here are a few things to consider:
First, in order to avoid killing a mosquito with a baseball bat, figure out your cost of downtime. These can help you (and the executives funding the bat) understand how much data and time you can afford to lose if your VMs are interrupted or if they fail all together.
To figure out how much you have to lose, think about how much money the company would lose if all the transaction data and emails from the last twelve to twenty-four hours were lost, if there are compliance risks with not being able to produce data, and what it would cost to have employees recreate the last twelve to twenty-four hours of data.
Or, if you want to know exactly what an hour of downtime will cost you, you can use a simple formula to figure it out:
Cost Per Occurrence = (TO + TD) x (HR + LR)
TO = Length of Outage
TD = Time Delta (the length of time since your last backup)
HR = Hourly Rate of Personnel (monthly expense per department divided by the total number of work hours)
LR = Lost Revenue per Hour (applies if the department generates profit – a good rule is to look at profitability over three months and divide by the number of work hours)
Second, before deciding how to protect your data, it is important to consider what the data is being protected from. Four reasons why companies often protect their data are listed below:
- Loss of a single resource – In this scenario, a single important resource fails or is interrupted. For example, losing a virtual server that end users depend on for product ordering would cripple a business that depends on electronic procurement. Likewise, many businesses would be seriously affected by the loss of one of their primary e-mail servers. Planning for this case usually means providing both backup and availability for the virtual server.
- Loss of user data files – This unfortunately common scenario involves the accidental or intentional loss of important data files. The most common solution is to restore the lost data from a backup, but this can involve going back to a previous snapshot of the server – often with data loss.
- Planned outages for maintenance or migration – The goal of planned maintenance or migrations on virtual machines is usually to restore or repair or patch service. Migrations usually mean users won’t have access to applications and data on virtual machines while the migration is in progress and is tested. If you can tolerate downtime and your IT staff doesn’t mind putting in a night or weekend, the most basic migration software will do. If you require availability and for users to remain online during a migration, more advanced software will allow you to perform migrations with no service interruption.
- Loss of an entire facility – In this scenario, entire facilities and all of their resources are unavailable as a result of natural disasters, extended power outages, failure of the facility’s environmental conditioning systems, fire, flood or any other disaster our outage that takes out power or a physical structures. In this situation, it’s best to resume operations at another physical site. The amount of downtime you can tolerate will determine if you require just backup or backup and availability.
Few companies can tolerate days of downtime so they feel good about having an availability and disaster recovery plan in place, but don’t feel so great about sorting through the options. The good news is that if you’ve done your downtime homework, it’s not so bad.
Option 1: Tape backup and recovery
Tape backup can provide for the long-term archival needs of virtual servers, it’s portable, fairly secure, and the cost per megabyte is low. Tape backup is right for you if you can tolerate a day or more of downtime while you rebuild a whole VM or a whole server, if you have staff that won’t forget to switch tapes, if you aren’t regulated, and if you don’t have a lot to lose in a day’s worth of data.
Option 2: Replication-based backup and recovery
Backup software installs on your production and backup virtual servers and replicate data anywhere from once-a-day to “snapshots” or point-in-time recovery to continuous, real-time options. If your production server fails, you’ve got a copy of your data (and sometimes applications) waiting on you. If you can make a home for your backup server outside of your zip code you’ll avoid building and local disasters. If you can get it out of your region you’re even better protected. When evaluating products, look for hardware-independent capabilities – that way if your production server is actually destroyed you can recover to a different make and model and won’t have to go looking for an exact copy of the old server. Software-based backup is a great option if you can tolerate a few hours of downtime, don’t want to constantly manage the tape backup process (or can’t, as in the case of branch offices) and can’t afford to lose too much data.
Option 3: Add availability
The granddaddy of these options is software-based backup and recovery that includes continuous access to data and applications on your virtual machines in any situation. It’s usually not much more expensive than simple backup and recovery software, but the right product provides a whole other level of comfort if you can’t afford to lose data or time. Most availability options don’t offer a whole lot of granularity, but there are some that offers immediate failover and recovery. A comprehensive product in this category offers everything backup and recovery software does, except if your virtual server melts down, the backup server steps in immediately with a current copy of your data and applications. With good quality software, the switch happens so seamlessly that your end users won’t notice anything’s gone wonky in the server room.
Option 4: The cloud
The newest, and maybe coolest, kid on the block is backup and recovery in the cloud. Simply by installing cloud software on your VMs and renting a chunk of space from a cloud provider, you can have a secure virtual datacenter for a fraction of the cost of a physical datacenter. If you’re choosing cloud recovery, it’s probably a good idea to pick a software / storage combo that allows you to recover, or start up applications and data, from the cloud.
If you’re running VMs your eggs are all in one basket, so you’ve got to have a backup and recovery plan. Once you figure out how much downtime you can tolerate, you can set a realistic budget and pick the right solution for putting that plan in place.
Robin Howard is a Technology Anthropologist and project manager at Double-Take Software.