This is not an esoteric question. Market research about the disappointing state of backup and data protection shows that recovery failures occur all the time. Anecdotal evidence also points to recovery failures across enterprises of all sizes and industries. We hear from many prospects looking for a new backup and recovery solution that their impetus for replacing their existing tools was an unforeseen recovery failure.
A failed recovery is not a random event. In our 30 years of focusing on backup and cloud DR, we’ve learned to recognize the warning signs of known software conditions and configurations that, if not handled properly will make a recovery unlikely. Also, the lack of good process in a few key areas can increase the likelihood that a recovery will fail.
The good news is that if you know where and what to look for, you can potentially deal with these issues before they become a crisis. To help you assess your risk, Unitrends has created a tool that contains 10 questions to score your risk of experiencing a failed recovery.
Here are two examples of the conditions that can greatly increase your odds of a failed recovery:
Introducing VSS Errors
Volume Shadow Copy Service or VSS is a Microsoft technology that allows users to make manual or automatic snapshots of their data files. These snapshots are then copied to remote drives to be used as backups. Many Windows apps come with their own VSS app included, so many versions will exist on your server at the same time. If more than one VSS process runs in the same environment, there will be a conflict resulting in the failure of one or more VSS Writers. VSS issues can have multiple causes:
When and how do you test recovery
There is nothing you can do to better ensure successful recoveries than regular and automated testing. However, most organizations don’t test frequently or thoroughly enough. To lower the risk of a failed recovery, best-in-class organizations:
These are just two of the categories of issues that can cause recovery failures. To help you assess your risk of experiencing a recovery failure please see our Recovery Assessment Tool and answer 10 questions. Your final score will be a good indicator of potential problems. Based on your risk level, we also suggest steps you can take to reduce the chances your next emergency recovery will fail.
Downtime is potentially a very stressful and chaotic time. This is not the time or climate you want to have to do forensic analysis on why your recovery has failed. Recovery is an example of the idiom “an ounce of prevention is better than a pound of cure.”