12-17-14 | Blog Post
Data is growing exponentially. Not just in size, but also in importance to organizations. So how do you protect that data while simultaneously assuring it can be quickly recovered in case of an emergency?
A recent Online Tech “Tuesdays at 2” webinar discussed how organizations can turn offsite backup into a recovery option.
In “Transforming Your Offsite Backup into a Real Recovery Option,” guest host Steven Aiello led an in-depth dive into what he calls the “data protection spectrum” and how companies should – and should not – use products on that spectrum to create a better offsite backup and recovery solution.
Aiello was formerly a Senior Product Engineer at Online Tech and currently works as a Technical Architect for Chicago-based IT consulting firm AHEAD, where he helps build infrastructure and security solutions for large organizations in the financial, health care and other assorted verticals.
Aiello discussed the following data protection spectrum: availability, replication, backup and storage. While discussing the pros and cons of each category, Aiello stressed that all of them can play a key role depending on an organization’s needs.
“It’s not that any one is better or worse than the other,” he said. “It’s using the right tool for the right job, and how all of these pieces plays into your data protection strategy.”
He added later that a single organization could find that it needs to be using every entry on the spectrum. “Most likely, for a large organization, that’s exactly where they need to be.”
Below is a brief recap of Aiello’s points. For a deeper dive, a replay of the webinar is available.
By definition, availability means one thing: Applications never go down. It also means another thing: A huge capital investment. Aiello says that in exchange for six 9s – or 99.9999% uptime – companies need to be willing to hand over a six-figure check.
But it could very well be worth the price. Aiello uses the example of a trading house that is trading billions of dollars of securities each day. That same company can lose tens of millions of dollars if an application is down for minutes.
Availability has a very specific business use case, so we won’t spend a lot of time on it here. But Aiello does note one distinct limitation: The laws of physics. “The fastest that we can shoot data across the globe is at the speed of light. If your application can only tolerate a couple of milliseconds of latency, the distance from point A to point B starts to come into play.”
Replication serves another specific use case; reducing recovery point objective (RPO) and recover time objective (RTO) scenarios at a moderately expensive – yet considerably less than availability – price tag.
Aiello said the benefit of replication is two-fold:
First, data centers can be located further apart because the replication from data center A to data center B is not synchronous.
Also, unlike availability, replication technology does allow for a moderate amount of data retention. Still, Aiello stresses that neither availability nor replication are the equivalent of data backup. Asking a company in the health care industry, as an example, to keep medical records for 21 years in a replicated model is not feasible. “It becomes too expensive,” Aiello said.
Skipping over “Backup” on the data protection spectrum for a moment – since backup was the primary focus of the webinar – Aiello described archiving as a business deciding “the likelihood of this data being relevant, past 30 days, is almost nonexistent.”
An archiving system is the cheapest on the spectrum because a business may never need the data. If it does, it will pay for the retrieval. That’s a fair tradeoff considering the likely of needing it is minimal.
Aiello calls backup “the sweet spot” in data protection because it’s relatively inexpensive when compared to the replication and availability models but allows organizations to start building out a recovery plan.
What is an appropriate backup solution? Along with being granular, able to restore specific files and having a catalog, Aiello pointed to the following “must-haves”:
Encryption. “That was something we took a great, great deal of pride here at Online Tech, in working together. It was always about protecting the customer data. That goes from the primary storage platforms that the production systems run on, all the way to those backups as they leave the production environment. They are encrypted in transit, and they’re encrypted at rest.”
Data durability. “Backups should be durable. When you pick a backup solution … you are essentially betting the company on how reliable that backup technology is. It’s a big, big bet that you’re making.
“When I look at a backup solution, the first question I ask, ‘What’s your data durability rating? Has your product literally ever lost data, because of a malfunction?’”
While defining backup, Aiello found it important to discuss what backup is not. “I’m seeing some people in the industry try to use a couple of different things to be backup, which it’s not designed for.”
The first is enterprise file sync & share (EFSS) products like DropBox or Box.com. There is no revision history in these products. Having multiple copies of any file that is only a point in time does not constitute a backup. It constitutes multiple copies of a file, but it doesn’t allow you to step back in time to recover corrupt data.
The second is snapshots, a tool used as a feature of a replication strategy that cannot be confused for being a backup. First, if an administrator is running servers with a bunch of snapshots on them, there will be a decrease in performance. They’re not designed to be kept around for a long time. Also, they’re not granular and there’s no catalog. “If you have to find one file out of three-quarters of a million, and it’s been given some sort of random label, your RTOs and RPOs just explode,” he said. Finally, snapshots offer no encryption.
DISASTER RECOVERY WORKSHOPS
Want to hear more from Aiello and other disaster recovery experts? Join us at a one of our upcoming half-day workshops scheduled to be held at our Indianapolis Data Center on Thursday, Dec. 22 and our Metro Detroit Data Center on Thursday, Feb. 19. Attendance at both workshops is limited to 50, so register now!
Aiello will be joined on our expert panel by Baseline Data Services founder and president Lance Thompson and Online Tech Director of Infrastructure Nick Lumsden. They’ll present the following agenda at both workshops: