Business Continuity/Disaster Recovery Plan Development

  Windows IT Pro
Windows IT Library
  - Advertise        
Windows IT Pro Logo

  Home  |   Books  |   Chapters  |   Topics  |   Authors  |   Book Reviews  |   Whitepapers  |   About Us  |   Contact Us  |   ITTV  |   IT Jobs

search for  on    power search   help
 






Business Continuity/Disaster Recovery Plan Development
View the book table of contents
Author: Susan Snedaker
Published: June 2007
Copyright: 2007
Publisher: Syngress Publishing
 


As with previous chapters, we’ll begin with a review of where we are in this process (see Figure 6.1). Creating the BC/DR plan entails putting together the information you’ve developed so far and adding a bit more detail.


 

INTRODUCTION

The bulk of your work in developing your business continuity and disaster recovery plan is complete when you get to this point. Granted, you may be reading this book through from start to finish before developing your plan (recommended) and therefore you will have none of the actual work completed. However, things move quickly in the business world and there are some of you who are doing the work as you read each chapter. Either way, this is where everything comes together. The risk analysis you performed led you into your vulnerability assessment. That data helped you develop an assessment of the impact various risks would have on your business. Finally, you took all your data and identified mitigation strategies— actions you could take to avoid, reduce, transfer, or accept the various risks you found. With that, you now have to develop a plan that takes your mitigation strategies and identifies both methods for implementing those strategies, and people, resources, and tasks needed to complete these activities.

In Chapter 7, we’ll go over emergency activities including disaster response and business recovery, so we’ll refer only briefly to those elements in this chapter where appropriate. In Chapter 8, we’ll discuss training and testing and in Chapter 9, we’ll discuss maintaining the plan. All of these are elements that should be included in your BC/DR plan as well.

The plan basically needs to state the risks, the vulnerabilities, and the potential impact to each of the mission-critical business functions. For each of these, there should be associated mitigation strategies. In some cases, there will be multiple mitigation strategies; in other cases, you may have elected to simply accept the risk. However, all of this should be clearly laid out in your documentation thus far. Next, you need to determine how and when those strategies are implemented and by whom.

Your work breakdown structure will look something like this:
  1. Identify risks (complete).
  2. Assess vulnerability to risks (complete).
  3. Determine potential impact on business (complete)
  4. Identify mission-critical business functions (complete).
  5. Develop mitigation strategies for mission-critical functions (complete).
  6. Develop teams.
  7. Implement mitigation strategies.
  8. Develop plan activation guidelines.
  9. Develop plan transition guidelines.
  10. Develop plan training, testing, auditing procedures.
  11. Develop plan maintenance procedures.
As you can see from this simplified list, you should already have items one through five completed. We’ll discuss developing teams in this chapter as it relates to carrying out the BC/DR plan, not the planning team that you should already have in place (and who hopefully have helped you accomplish tasks one through five). We’ll cover developing plan activation and transition guidelines in this chapter before heading into Chapter 7. At the end of this chapter, you’ll have items one through nine complete (or will understand how to complete them when you begin project work).

As with previous chapters, we’ll begin with a review of where we are in this process (see Figure 6.1). Creating the BC/DR plan entails putting together the information you’ve developed so far and adding a bit more detail. We’ll create the BC/DR plan document in this chapter, but keep in mind we’ll have to circle back later to add detail that we develop in upcoming chapters.


PHASES OF THE BUSINESS CONTINUITY AND DISASTER RECOVERY

Hopefully you’ll never need to put your BC/DR plan into action, despite all the hard work you put into it. If you do need to use your plan, however, you’ll need to have clear and specific guidelines for how and when to implement it. Let’s begin with a quick look at the phases of the plan: activation, disaster recovery, business resumption or business continuity, and transition to normal operations.

Activation Phase

The activation phase of your BC/DR plan addresses the time during and immediately after a business disruption. In this section of your plan, you need to define when your BC/DR plan will be activated and in what manner. You don’t want to activate your plan for every little glitch your business runs into, so you’ll need to develop a clear set of parameters that you can use to determine if or when to activate your BC/DR plan. In addition, you will need to define how your plan is activated, including who has the authority to activate it and what steps that person (or persons) will take to initiate BC/DR activities.

Activation includes initial response and notification, problem assessment and escalation, disaster declaration, and plan implementation. After you have begun implementing the plan, you proceed into the recovery phase, as shown in Figure 6.2.

It is in this activation phase that you should define various disaster or disruption levels so that you know when, if, and how to implement your plan. For example, if you experience a network security breach, you’ll have to activate different phases of your plan than if the server room is flooded. Therefore, defining various disaster types and levels is important in understanding what should trigger the implementation of BC/DR plans. You may choose to use a three-level rating system, as described here. However, make sure that whatever system you devise, it’s tailored to your specific business configuration and that it gives you the guidance you’d need to make these crucial decisions based on predetermined and agreed-upon criteria.

Major Disaster or Disruption
The possibility or likelihood of this type of disaster occurring is low but the business impact is extremely high. This event disrupts all or most of the normal business operations of the company and all or most of its critical business processes. The disruptions occur because all or a majority of systems and equipment have failed or are inaccessible. This includes destruction to the entire facility; a major portion of the facility; or entire networks, subnets, or sections of the business. Once you’ve defined what this level of disaster or disruption entails, you should define the process for determining which parts of your BC/DR plan should be activated and which team members should be called upon. We’ll discuss triggers more in a moment; for now you should attempt to define the business systems, mission-critical functions, and major operations that when affected would cause a major disruption. This will help you develop appropriate triggers to determine when and how to activate your BC/DR plan.

Intermediate Disaster or Disruption
An intermediate disaster is likely to occur more frequently than a major disaster, but less frequently than a minor disaster (hence the "intermediate" designation). Its impact will be less than a major and more than a minor event. This type of disruption or disaster interrupts or impacts one or more mission-critical functions or business units, but not all of them. Operations will experience significant disruption, entire systems or multiple systems may fail or be unavailable, but not all of them. An intermediate event could include a fire or flood in the building that impacts IT systems and equipment, structural damage to part of the building where critical operations occur or where vital equipment is located. As with the other two levels of disruption, it’s important to define not only what each tier consists of but which parts of the BC/DR plan should be activated and which team members should begin implementing BC/DR activities. As with a major disruption, clearly delineate which systems, functions, and operations would be impacted to earn an intermediate designation so you can define triggers that will address these types of situations.

Minor Disaster or Disruption
Minor disruptions occur every day in the business world and rarely, if ever, are BC/DR plans called into action. The likelihood of a minor event occurring is high, the associated disruption is low. The effects typically are isolated to one component, one system, one business function, or just one segment of a critical business function. Normal operations can often continue, almost uninterrupted, in the face of a minor disruption. Critical business functions still occur for some period of time after this type of disruption. The failure of a single system or service can typically be addressed during the normal course of business. For example, the failure of a single server, system disk, or phone system is problematic but usually does not require the activation of a BC/DR plan. There may be examples, however, where minor disruptions should be addressed by the activation of part of a BC/DR plan. If that is the case, be sure to clearly identify those disruptions along with which sections of the BC/DR plan should be implemented when and by whom.

Activating BC/DR Teams
Clearly, the BC/DR plan cannot activate itself, someone or a team of people need to make appropriate assessments of the situation and make a determination as to whether or not to activate the plan or portions thereof. Therefore, it’s also important to create and maintain various BC/DR teams that handle the response to the business disruption by implementing appropriate sections of the BC/DR plan. We’ll discuss the makeup of these teams later in this chapter, but for now we’ll list some of the BC/DR teams you may want to define and populate as you continue in this planning process.
  • Crisis management team
  • Damage assessment team
  • Notification team
  • Emergency response team
  • Business continuity coordinator or lead
  • Crisis communication team
  • Resource and logistics team
  • Risk assessment manager
Depending on the size and nature of your company, you may or may not need some of these functions. It’s also possible that one person may fill one or more roles if you’re working in a small company. We’ll discuss these roles in more detail in a section coming up later in this chapter.

Developing Triggers
If you’re familiar with project management, you’re probably familiar with triggers. Typically, risks and triggers are identified so that if a project risk occurs, a trigger defines when an alternate plan or method should be implemented. The same is true here. If you are going to implement your plan, you’ll need to define how and when that should occur—those are your triggers. For example, if you use the three categories of major, intermediate, and minor, you’ll need to define what actions are taken in each case. Each level of disruption should have clearly defined triggers. Let’s look at a hypothetical example. You’re the IT manager of a small firm and the head of the BC/DR team. You’re at home one evening just sitting down to dinner when one of the data processing operators who works until 9 P. M. calls you. She reports that there was a fire in the building, it’s been evacuated, and the fire department is on the scene. You ask her a series of questions and ascertain that the fire seems to have been contained relatively quickly but that some of the networking gear may have been damaged either by the fire or by the fire containment efforts. She believes the server room is in tact but she’s not sure. If you have clearly defined triggers in place, you may determine that this appears to be either a minor or an intermediate disruption and that you should most likely activate a portion of your BC/DR plan. The trigger might be defined as a series of steps such as:
  1. Business disruption event has occurred.
  2. Disruption to business operations has occurred.
  3. Initial assessment by employees on the scene indicates intermediate level damage, including the following:
    • A portion of the network is or may be out of service.
    • One or more critical servers are or may be out of service.
    • A portion of the physical facility has been impacted by the disruption.
    • It is likely employees will not be able to resume normal operations within two hours.
This is an example of a trigger you could define for intermediate types of events. As you’ve done previously, using scenarios helps you define these elements more clearly. By defining three statements and four attributes, you have a good understanding of whether or not to activate the BC/DR plan for intermediate outages. You also have a defined timeline— if normal business operations cannot resume within two hours. This should be tied to your overall maximum tolerable downtime (MTD) and other recovery metrics developed earlier. If your MTD is 24 hours, an intermediate disruption might be something that will disrupt normal operations for two to six hours. You and your team will need to define these various windows, but be sure to tie your triggers to your recovery metrics.

Your intermediate activation steps are related to the trigger. Once you know you should activate your plan, you should define the immediate steps to be taken. This helps remove any uncertainty about next steps and helps begin a focused response effort. An example of the first steps for an intermediate disruption is shown here.
  1. If a disruption appears to be intermediate on initial assessment, within two hours:
    • Attempt to gather information from the emergency responders, if appropriate.
    • Activate the damage assessment team.
    • Notify the crisis management team to be on standby notice.
  2. After two hours from event notification, gather initial evaluation from damage assessment team.
  3. After three hours, notify crisis management team of next steps (stand down, fully activate).
  4. Within three hours of event notification, BC/DR plan should be implemented if assessment indicates intermediate or major disruption.
Notice, though, that our description of the actual disruption levels includes trigger information. How many systems are impacted? How extensive is the damage? The more clearly you can define these details, the more precise your triggers will be, and this will help you determine if and when to activate your plan. Spend time clearly defining the circumstances that will warrant plan activation at the various levels you’ve defined and also spend time defining initial steps to be taken in each phase so that you have checklists of next steps. We’ll provide additional checklists you can use as starting points for your own lists when we go over disaster recovery steps in the next chapter as well as in the appendix materials at the end of this book.

Transition Trigger—Activation to Recovery
Another trigger to define is when to move from one phase to another. In this case, that means when to move from the activation phase to the recovery phase. This transition is one that typically occurs fairly naturally, so you don’t need to over-engineer this. However, you may want to define the transition trigger like this:
  1. The damage assessment team’s initial evaluation indicates an intermediate disruption.
  2. The crisis management team has been called in and is on scene.
  3. The immediate cause of the event has stopped or been contained.
  4. The intermediate section of the BC/DR plan has been activated.
You may wish to define other triggers for your transition, from activation to recovery, suitable to your organization. When defining your triggers throughout, keep your maximum tolerable downtime (MTD) and other defined metrics in mind so that you can work within those constraints. For example, if your MTD is very short, your time between activation and recovery also should be very short. In this case, you may have to err on the side of timeliness and take action with incomplete or preliminary data. You’ll have to balance your need to collect information with your need to get the business back up and running as quickly as possible (and within your MTD constraints). Rarely, if ever, is there perfect data in an emergency (or any other time). Defining these triggers and constraints clearly in your plan can help you make better decisions in the stressful aftermath of a business disruption or disaster. Help the team make the best decisions possible by spending time now to define these triggers as clearly and unambiguously as possible.

Recovery Phase

The recovery phase is the first phase of work in the immediate aftermath of the disruption or disaster. This phase usually assumes that the cause of the disruption has subsided, stopped, or been contained, but not always. For example, in the case of flooding, you may decide that if it’s external flooding, you will wait until waters subside to begin recovery efforts. This may be required by local officials who restrict access to flooded areas. However, in other cases, you may be able to or choose to initiate recovery efforts while flooding is still occurring. This might include placing sandbags around the entryways to the building or removing equipment that is not yet under water. As you can tell, many of your actions will be dictated by the specifics of the situation, so there’s no simple rule to follow here. However, we can say that recovery efforts have to do with recovering from the immediate aftermath of the event, whether or not the event is still occurring. This phase may also include evacuating the facility, removing equipment that can be salvaged quickly, assessing the situation or damage, and determining which recovery steps are needed to get operations up and going again. The recovery phase is discussed in detail in Chapter 7.

Transition Trigger—Recovery to Continuity
You’ll learn more about recovery activities in Chapter 7, so you’ll need to circle back and define these triggers after you understand the information covered in that chapter. At this juncture, you can make a note that you need to develop triggers that help you know when to transition from recovery efforts to business continuity efforts. Typically, these triggers will have to do with determining that the effects of the disruption have been addressed and are not getting any worse. For example, if you experience a fire in the building, the fire is out, the assessment has been done, any equipment or supplies that can be salvaged have been, and alternate computing facilities have been activated. Those are activities that take place in the recovery phase and when these are all complete, it’s time to move into the business continuity phase, which typically includes starting up systems so that business operations can resume. Defining these points should include specific events that have occurred, milestones that have been met, or time that has elapsed. Also keep your MTD in mind as you define triggers for this transition.

Business Continuity Phase

The business continuity phase kicks in after the recovery phase and defines the steps needed to get back to "business as usual." For example, if you have a fire in the building, the recovery phase might include salvaging undamaged equipment, ordering two new servers from a hardware vendor, and loading up the applications and backup data on the servers at a temporary location so that you can begin to recover your data and your business operations. The business continuity phase would address how you actually begin to resume operations from that temporary location, what work-arounds need to be implemented, what manual methods will be used in this interim period, and so forth. The final steps in the business continuity phase will address how you move from that temporary location to your repaired facility, how you reintegrate or synchronize your data, and how you transition back to your normal operations. This detail is discussed in Chapter 7. You’ll also need to define triggers here that define when you end business continuity activities and when you resume normal operations. Again, as with the other triggers, you should strive to be as clear and concise as possible. You’ll have enough to deal with later if you do end up activating and implementing your plan, so spend time here to save yourself a headache later on.

Although it might seem intuitive that you’ll resume normal operations when everything is back to normal, things sometimes do not return to normal after a business disruption of any magnitude. Certainly, business operations will resume, but some things may change permanently as a consequence of this disruption. For example, your company may decide as a result of a major fire or flood that it wants to move to a new location and it’s going to do that while operating from the alternate site. That would complicate things because it would mean moving from the alternate site to a new site, with all the concomitant challenges inherent in both resuming normal operations and moving to a new facility. Though this example may seem outside the bounds of normal business decision-making, be assured that disruptions can change the way companies see their businesses and the way they approach operations. Another example is developing a work-around that’s used in the recovery phase that works so well that someone decides to use it full time. When do you transition back to normal operations if you incorporate BC/DR work-arounds? When do you officially transition back to normal operations if you decide that the new server role or network configuration actually works better than the original? It might be a simple matter of formally evaluating the change, agreeing to make it permanent, and declaring you’re now running under normal operating conditions. You and your team can define these triggers in advance and you may need to modify them later but at least you won’t be working with a blank slate.

Maintenance/Review Phase

The maintenance phase has to occur whether or not you ever activate your BC/DR plan. On a periodic basis, you need to review your BC/DR plan to ensure that it is still current and relevant. As operations and technology components change, as you add or change facilities or locations, you’ll need to make sure that your plan is still up to date. One common problem in BC/DR planning is that companies may expend time to develop a plan but they often do not want to (or will not) expend the time and resources necessary to keep the plan current. Old plans are dangerous because they provide a false sense of security and may lead to significant gaps in coverage. If a plan is not maintained, then all the time and money invested in creating the plan is wasted as well. In addition, if you end up activating your BC/DR plan at some point, you’ll want to assess the effectiveness of the plan afterward, when things settle down. You should do this relatively close to the end of the recovery and business continuity cycles so that lessons learned can be captured and applied to your BC/DR plan before memories fade and people go back to their daily routines. Reviewing the plan in the immediate aftermath of a disruption will give you valuable insights into what did and did not work. Incorporating this knowledge into your plan will help you continue to hone the plan to meet your evolving business needs. This is discussed further in Chapter 9.


DEFINING BC/DR TEAMS AND KEY PERSONNEL

There are numerous people in positions that are critical to the activation, implementation, and maintenance of your BC/DR plan. Although these may not all be relevant to your organization, this will serve as a good checkpoint to determine who should be included in your various phases. You’ll also need to form teams to fulfill various needs before, during, and after a business disruption or disaster. Where possible, you should specify a particular position or role that meets the need rather than specifying individuals. If your Facilities Manager should participate in the Damage Assessment Team, for example, you should specify the Facilities Manager and not Phil, who happens to be the Facilities Manager now. That will allow your plan to remain relevant whether Phil wins the lottery and leaves the company, gets hit by a bus and is out for an extended period of time, or is promoted to vice president.

Though we briefly define types of teams and their roles in the BC/DR effort, you should take time to clearly define the roles and responsibilities of each team. Having clear boundaries will help ensure that teams are not working at cross-purposes and that all aspects of the plan are covered. Gaps and omissions occur when these kinds of definitions are illformed. If helpful, you can create team descriptions that read like job descriptions and you can task members of your HR department on the BC/DR team to assist with or lead this activity. A good team description will identify the following attributes:
  • Positions or job functions included on the team (Facilities Manager, HR Director, etc. )
  • Team leader and contact information
  • Team mission statement or set of objectives
  • Scope of responsibilities (define what is and is not part of this team’s mission)
  • Delineation of responsibilities in each phase of BC/DR (i. e. , when will the team be activated and deactivated?)
  • Escalation path and criteria
  • Other data, as needed
Crisis Management Team

In most companies, the composition of crisis management team will mirror the organizational chart. It should have representatives from across the organization and should bring together members of the company who have the expertise and authority to deal with the after-effects of a major business disruption. The crisis management team (CMT) will decide upon the immediate course of action in most cases and when necessary, they can contact senior management. They will direct the distribution and use of resources (including personnel) and will monitor the effectiveness of recovery activities. They can adjust the course of action, as needed. They should be in charge of activating, implementing, managing, and monitoring the business continuity and disaster recovery plan and should delegate tasks as appropriate.

Management

Each company has a management team or structure that oversees the business and its operations. You’ll need to determine which positions from your management team should be included in your plan. Remember to review all the phases. For example, you might decide that only a member of the management team can cause the BC/DR plan to be activated.

Management might be required to decide when to transition from disaster recovery to business continuity activities or they might be the one(s) to decide how and when the BC/DR plan should be tested. Identify the positions that should participate as well as define how they should participate in each phase.

Damage Assessment Team

A damage assessment team should be comprised of people from several key areas of the company, including Facilities, IT, HR, and Operations. Your company’s damage assessment team may contain other members, depending on how the company is structured and what type of business you’re in. If you work in a small software development firm, you may just need the CEO, the IT manager, and the office manager to operate as the damage assessment team. In larger companies with multiple locations, you’ll need to have several damage assessment teams or you may choose to create a mobile team that can fly to any site and assess damage within 24 hours of an incident. You may choose to have both a local and a mobile corporate team so that the right team can be called in. If the building floods, you may not need the mobile team to come in. However, if you have a large fire, earthquake, or other major event, you may need the support services of a mobile damage assessment team.

Operations Assessment Team

You may choose to have a separate operations assessment team comprised of individuals who can assess the immediate impact on operations. A damage assessment team may be tasked with this job, but in some types of companies, you may need a separate operations team that can assess what’s going on with operations and how to proceed. The operations assessment team can also be tasked with beginning recovery phase activities, monitoring and triggering the transition from activation to recovery, recovery to business continuity, and BC to normal operations.

IT Team

Clearly, you need an IT team that can not only assess the damage to systems, but can begin the disaster recovery and business continuity tasks once the plan is activated. This IT team will work closely with the damage assessment team and/or the operations assessment team to determine the nature and extent of damage, especially to IT systems and the IT infrastructure. You may not need some of the technical specialties listed here, but this should be a good starter list for you to work from to determine exactly what expertise you’ll need on your team.
  • Operating system administration
  • Systems software
  • Server recovery (client server,Web server, application server, etc. )
  • LAN/WAN recovery
  • Database recovery
  • Network operations recovery
  • Application recovery
  • Telecommunications
  • Hardware salvage
  • Alternate site recovery coordination
  • Original site restoration/salvage coordination
  • Test Team
Administrative Support Team

During a business disruption, there are a wide variety of administrative tasks that must be handled. Creating an administrative support team that can respond to the unique needs of the situation as well as provide administrative support for the company during the disruptions is important. This might include ordering emergency supplies, working with vendors arranging deliveries, tracking shipments, fielding phone calls from the media or investors, organizing paper documents used for stopgap measures, and more.

Transportation and Relocation Team

Depending on the specifics of your BC/DR plan and the type of company you work in, you may need to make transportation arrangements for critical business documents, records, or equipment. You may need to move equipment in advance of an event (like a hurricane or flood) or you may need to move equipment after the event to prevent further damage or vandalism. Relocating the company and its assets before or after a disruption requires a concerted effort by people who understand the company, its relocation needs, and transportation constraints.

Media Relations Team

You may recall that in Chapter 1, we mentioned the need to create a crisis communication plan because you will need to provide information about the business disruption/disaster to employees, vendors, the community, the media, and investors. One key area that should be well-prepped is media relations. Unlike other stakeholders mentioned, the media makes its living selling interesting stories. Since a disruption at your business may qualify as news, you might as well craft the message rather than leaving it to outsiders. Creating a team that knows how to handle the media in a positive manner and that understands the policies and procedures related to talking with the media is vital to help ensure your company’s image and reputation are maintained to the greatest extent possible. Certainly, if your company is at fault, you will have to deal with a different set of questions than if your company experiences a natural disaster. Still, you’ll need to manage the story either way.

Human Resources Team

The aftermath of a crisis is an incredibly stressful time for all employees. Having an HR team in place to begin handling employee issues is crucial to the well-being of the employees and the long-term health of the company. Retaining key employees, adequately addressing employee concerns, facilitating insurance and medical coverage, and addressing pay and payroll issues are part of this team’s mission. This team may also be responsible for activating parts of the BC/DR team as it relates to hiring contract labor, temporary workers, or staff at alternate locations.

Legal Affairs Team

Whether your legal experts are internal or external to your company, you should identify who needs to address legal concerns in the aftermath of a business disruption or emergency. If you hire outside counsel to assist you with legal matters, you should still assign an internal resource as the liaison so that legal matters will be properly routed through the company. If you operate in a heavily regulated industry such as banking, finance, or health care, you should be well aware of the constraints you face, but having a legal affairs team can assist in making decisions that keep your company’s operations within the bounds of laws and regulations. Even if you’re not in a heavily regulated industry, you may need advice and assistance in understanding laws and regulations in your recovery efforts.

Physical/Personnel Security Team

In the aftermath of a serious business disruption, you will need a team of people who address the physical safety of people and the building. These might be designated Human Resource representatives, people from your facilities group, or both. If you work in a large company or in a large facility, you may have a separate security department or function that manages the physical and personnel security for the building. If this is the case, designated members of their team should be assigned to be part of the BC/DR team. If you don’t have a formal security staff, be sure that the members of this ad hoc team receive training. Someone from HR or facilities might be willing to take on the role of security in the aftermath of a disaster, but they need to be trained as to the safest, most effective method of managing the situation. Training for part-time or ad hoc security teams is crucial because if a natural disaster strikes, emergency personnel such as your fire or police department will focus on helping schools, day care centers, nursing homes, and hospitals first. Your company may fall very low on the list of priorities, so having trained staff that can fill the gap in an emergency may literally mean the difference between life and death. We’ll discuss training later in the book, but keep this in mind as you develop your teams.

Procurement Team (Equipment and Supplies)

Every company has some process in place for procuring equipment and supplies. In small companies, this might fall to the office manager or operations manager. In larger companies, there’s usually a purchasing department that handles this function. Regardless of how your company is organized, you need to determine, in advance, how equipment and supplies will be purchased, tracked, and managed after a localized disaster such as a fire or in the aftermath of a widespread disaster such as a hurricane or earthquake. This includes who has the authority to make purchases and from whom, what dollar limit the authority carries, and how that person (or persons) can get authority to make larger purchases. For example, a company might specify that three people have the authority to purchase equipment and supplies up to $2,000 per order and up to $20,000 total. Beyond that, they have to have the president or vice president sign off on purchases. This predetermined purchasing information can also be communicated to key vendors so they know the three people who are authorized and what the authorization limits are. In this way, if disaster strikes, the company can turn to trusted vendors who, in turn, know the rules. This can expedite the recovery process.

Keep in mind that this team needs to be large enough that there is no "single point of failure." If you authorize only one person and something happens to that one person, you’ll be scrambling to obtain emergency authorization for other individuals. Instead, authorize enough people to provide flexibility but not so many as to create chaos. Also, be sure your limits are appropriate to the type of business you run. If you may need to replace computers at $1500 a piece, make sure the limits reflect that. If a purchaser has a $1,000 limit per item, that will preclude him or her from making a simple purchase needed to get the company running again.

General Team Guidelines

Though we recommend populating teams first with needed skills based on roles and positions within the company, we also recognize that ultimately people are assigned to the team. People should be chosen to be on teams based on their skills, knowledge, and expertise, not because someone wants to be on a team or because someone’s boss placed them on a team. In a perfect world, you could choose team members solely on competence, but we all know that in the real world, that’s not always the case. Occasionally, you get the people who have the most time on their hands, who sometimes are the junior members of the team, or the least competent people in the department. You have to work within your organization’s constraints and culture, but also strive to populate your teams with the right people with the right skills. Ideally, these are the same people who perform these functions under normal conditions. It doesn’t make sense to have the database administrator take on media relations duties during an emergency, just as you don’t want the marketing VP managing the restoration of the CRM database, if possible. Certainly, in small companies many people are called upon to perform a variety of tasks and if that’s the case, the same will be true if the BC/DR plan has to be activated. The teams also should be large enough that if one or more members of the team are unable to perform their duties, the team can still function. If you have other personnel or other parts of the organization that can wholly take up the BC/DR activities, so much the better. If not, you may also choose to designate key contractors or vendors to assist as alternates in the event of a catastrophic event. These personnel should be coordinated and trained as alternates along with internal staff.

Looking Ahead...
Specialty Vendors Help BC/DR Plans
There are numerous specialty vendors that can provide tremendous assistance to your firm in the event of a business disruption such as a fire or chemical spill. Although the numbers and types of firms in your area will vary, you should consider your specific needs in advance of any disruption and search for a firm that will meet your needs, even if that firm is located across the country. These firms provide a wide and unusual assortment of services, some of which are listed here:
  • Chemical oxidation
  • CO2 blasting
  • Condensation drying
  • Contact cleaning
  • Corrosion removal
  • Damp blasting
  • Degreasing
  • Deodorizing
  • Fogging for odor removal or disinfection
  • Manual hand wiping
  • High pressure and ultra-high pressure jetting
  • High temperature steam jetting
  • Hot air drying
  • Low pressure jetting
  • Microwave drying
  • Ozone technology
  • Sanitation
  • Steam blasting
  • Vacuum drying
  • Water displacement
As you can see, this is quite a list and it’s not exhaustive. Be sure to think through the various scenarios that apply to your firm and determine which specialty services might best be outsourced to a qualified third party. You’ll save yourself time and money in the long run and you’ll likely get up and running much more quickly with targeted, competent help than if you try to do everything on your own.

BC/DR Contact Information

After you’ve developed the requirements for your teams in terms of the specific skills, knowledge, and expertise needed, you’ll identify the specific people to fill those roles. Part of plan maintenance, discussed later in this book, involves ensuring that the key positions are still in the BC/DR loop and that key personnel are still aware of their BC/DR responsibilities.

Another mundane but crucial task in your planning work is to compile key contact information. Since computer systems often are impacted by various types of business disruptions— from network security breaches to floods and fires—you’ll need to have contact information stored and available in electronic and hard copy. It should be readily available at alternate locations and copies should be stored in off-site locations that can be accessed if the building is not accessible. However, since this list contains contact information, it should also be treated as confidential or sensitive information and should be handled and secured as such. This information should include contact information for key personnel from the executives of the company (who will need to be notified of a business disruption) to BC/DR team members to key suppliers, contractors, and customers, among others.

Develop a list of the types of contact information you need, including:
  • Management
  • Key operations staff
  • BC/DR team members
  • Key suppliers, vendors, contractors (especially those with whom you have BC/DR contracts)
  • Key customers
  • Emergency numbers (fire, police, etc. )
  • Media representatives or PR firm (if appropriate)
  • Other
After you’ve identified the contact information you want to include, you’ll need to determine where and how this information currently is maintained. In most companies, this information is stored in a multitude of locations and is not easily compiled with a few clicks of the mouse. You may need to develop a process for maintaining an up-to-date list, both electronically and on paper, of these key contacts. For example, many of your key contacts may be in a contact management application made available to everyone in the company. However, information such as executives’ cell phone numbers and home phone numbers may not be included in this companywide contact database, for obvious reasons (especially if you work in a medium to large company). Therefore, you’ll need to have a copy of the contact information plus information not included there. Developing a process for gathering and maintaining that data is an important part of BC/DR readiness. If a serious business disruption occurs in the middle of the night—for example, the building catches on fire—who will you contact? How will you know who to contact? Where will you find the key phone numbers you need if you can’t get back into the building and you can’t access computer systems? Since notification is one of the first steps in activating your BC/DR plan, you’ll need to have key phone numbers available (you meaning the person(s) responsible for activating the plan). Develop a process for this during your BC/DR planning project and make sure that your maintenance plan includes regularly updating this information.

In addition to developing and maintaining a contact list, you should also define a contact tree. This defines who is responsible for contacting other teams, members of the company, or the management team. That way, each team member is tasked with specific calls to specific people and the notification process is streamlined.

Common Challenges
Maintaining Up-to-Date Contacts
Maintaining up-to-date contact information can be a challenge, especially since that information seems to change so frequently. If you work in a small company, you may task your office manager or other administrative support staff with maintaining this list and preparing an updated list once per month or once per quarter, storing it in designated locations and distributing it to key personnel. In larger companies, this task becomes a bit more difficult as contact information typically becomes fragmented— the contacts needed by the marketing group are not the same contacts needed by the IT group. Therefore, you may choose to have departmental responsibility for maintaining key contacts relevant to that business function. If you choose that route, be sure you still have someone with high-level BC/DR responsibility who oversees the maintenance of BC/DR contact information, which in this example would include the departmental representatives who have the contact information for their units. The master BC/DR contact list should be maintained by someone on the BC/DR team and should include, at minimum, contact information for key executives, department heads, regional managers (other locations), and key BC/DR vendors, contractors, and suppliers. Regardless of the method you choose for managing your contact information, be sure that it includes a process for regularly updating it. Also update the contact tree once the contact list is revised.


DEFINING TASKS, ASSIGNING RESOURCES

The tasks and resources that need to be assigned have to do both with implementing the mitigation strategies you’ve defined as well as fleshing out the rest of the plan. First, you have to ensure your risk mitigation strategies will be properly implemented. This may mean creating project plans to address any new initiatives you need to undertake in order to meet your risk mitigation requirements. We’ll assume you’ve got that covered as part of your risk mitigation strategy. If not, now’s the time to develop your work breakdown structure, tasks, resources, and timelines for completing any risk mitigation strategies that need to be completed in advance of a disruption. This might include purchasing and installing new uninterruptible power supplies for key servers, updating your fire suppression systems, or implementing a data vaulting solution. Other mitigation strategies such as arranging for an alternate site need to be completed in advance, but activating it requires a different set of tasks that occur later. Finally, strategies that include accepting risk mean there probably are no additional tasks at this time.

Other tasks have to do with defining your BC/DR teams, roles, and responsibilities; defining plan phase transition triggers; and gathering additional data. Let’s start with tasks related to some major activities including alternate sites and contracting for outside BC/DR services. Clearly, there are other tasks and resources you’ll need, but this should get you started in developing your own list of tasks, budgets, timelines, dependencies, and constraints for the remaining BC/DR activities in your plan.

As you develop these tasks, keep in mind standard project management processes:
  1. Identify high-level tasks, use verb/noun format when possible (i. e. , "test security settings" rather than "security settings").
  2. Break large tasks into smaller tasks until the work unit is manageable.
  3. Define duration or deadlines.
  4. Identify milestones.
  5. Assign task owners.
  6. Define task resources and other task requirements.
  7. Identify technical and functional requirements for task, if any.
  8. Define completion criteria for each task.
  9. Identify internal and external dependencies.
We’re not going to go through all that detail for these next two high-level tasks, but you should include this level of detail in your plan.

Alternate Site

Although this should be part of your BC/DR plan, it’s worth calling it out separately due to its importance and the need for advance work. If part of your risk mitigation strategy is to develop an alternative site or off-site storage solution, you should develop a number of details before moving forward. These should be tasks (or subtasks) within the WBS just discussed, so let’s look at some of the details you might include. Also keep in mind that you need to develop a trigger that helps you determine if or when you fire up the alternate site. You probably don’t want to activate the alternate site if you have a minor or even an intermediate disruption, so how do you define when you should? When all systems are down or when some percentage of systems down? You have to take your MTD into consideration along with other factors such as the cost of firing up the alternate site and the cost of downtime. If your downtime is estimated to be 12 days and that cost is $500,000 but the cost of firing up the alternate site is three days and $250,000, is it worth it to activate the alternate or should you just hobble along until you can restore systems at the current location? There’s no right or wrong answer, it’s going to depend on your company’s MTD, potential revenue losses, cost of starting up the alternate site, and so on. Have the financial folks on your BC/DR team prepare some analyses to determine metrics you can use to help determine your trigger point. As you’re going through the activities listed in this section, keep these factors in mind.

Selection Criteria
Selection criteria are the factors you develop to help you determine how to select the best alternate site solution for your company. This includes cost, technical and functional requirements, timelines, quality, availability, location, and more. Be sure to consider connectivity and communications requirements in this section along with your recovery requirements such as maximum tolerable downtime.

Contractual Terms
Determine what contractual arrangements are appropriate for your company. Many vendors have predetermined service offerings and contracts are fairly standard. Other companies can accommodate a wider range of options and will work with you to develop appropriate contractual language. In either case, be sure to run these contracts past your financial staff and your legal counsel to make sure you are fully aware of the financial and legal consequences of these contracts in advance of signing them. If you’re not clear what they mean operationally, be sure to talk with the vendor and add clarifying language to the contract. Do not simply take the vendor’s word that a particular paragraph or section mean something. Verbal agreements are always superseded by written contracts, so make sure the contract spells it out clearly. You don’t want to rely on verbal commitments made by employees no longer with the company when it comes to implementing your BC/DR solutions, so be sure to put everything in writing in advance.

Comparison Process
Be sure to specify what process you’ll use to select the vendor. This might include a list of technical requirements the vendor must meet, but it might also include an assessment of the vendor’s geographical location, financial history, and stability and industry expertise, among other things. Selecting the right vendor for an alternate site or off-site storage is a very important aspect to your BC/DR success and should be undertaken with the same rigor as your other planning activities.

Acquisition and Testing
Once you’ve selected your alternate site or off-site storage vendor and completed the contract, you will need to make whatever additional arrangements are needed for developing this solution so that it is fully ready in the timeframe you’ve designated. This might include purchasing additional hardware and software, setting up communications channels, and testing all solutions implemented. Create a thorough acquisition and testing plan for this phase so you can transition to it as seamlessly as possible in the event of a business disruption. During your testing phase of the BC/DR plan (see Chapter 8), you should test the process for firing up this solution on a periodic basis.

Contracts for BC/DR Services

Although we highly recommend you involve your purchasing, finance, and/or legal professionals in executing your BC/DR contracts, you should also have a general understanding of some of the elements to consider. As with alternate site considerations, keep your MTD, your costs, and potential losses in mind. Have your financial folks help you with performing financial analyses to determine what makes financial and business sense for your company. If a firm wants to charge you $50,000 for some sort of contract but your downtime estimate with associated revenue and collateral loss is only $40,000, the contract might not be worth entering. Additionally, determine your triggers for calling upon these contractual arrangements so you don’t prematurely fire up these contracts or avoid using them during times when they should be activated.

Develop Clear Functional and Technical Requirements
You know from project management fundamentals that developing functional and technical requirements is often what defines the difference between success and failure. The same is true here. If you have not clearly and fully defined your functional and technical requirements, you’ll get all kinds of vendor responses. The more specific you are, the more fully a vendor can address your needs. In addition, if you leave too many elements open to discussion, you’ll endlessly discuss possibilities without being able to identify appropriate solutions. Have these discussions in advance, and then come to a firm agreement about the requirements. If some requirements appear to be optional or "nice to have," then list them as options and not as requirements. Pare down your requirements to the elements you absolutely must have. Remember, the more options you include, the higher the cost is likely to be. Therefore, if cost is an issue (and it almost always is an issue), be sure to list what you require and what you desire as separate items. When this information has been finalized, write up formal requirements documents that you can provide to potential vendors. Also, be sure that your requirements documents are reviewed by subject matter experts, including IT experts and those in your company who understand regulatory, legal, and compliance issues. Your requirements should meet all these needs before going out to the vendors.

Determine Required Service Levels
Service levels are typically part of technical requirements, but we’ve listed them separately because they are vitally important when developing Requests for Proposal (RFP) or Requests for Quote (RFQ) from vendors. You may have contractual obligations to provide certain levels of service to your customers, so you may need to specify requirements for your vendors that meet or exceed these metrics. Even if you have no externally facing service level agreements (SLA), you should still specify SLAs in your contracts with vendors. If you’re contracting for Internet connectivity, you should specify bandwidth, minimum upload and download speeds, and maximum downtime per specified period, for example. These may sound like technical requirements, but let’s look at how this can play out in a contract. You write up your requirements, which include bandwidth, minimum upload/download speeds, and maximum downtime. Three vendors respond to your RFQ to provide backup Internet connectivity to your company in the event of an outage from your main vendor or in the event that your company’s facilities are damaged. All three companies give you quotes that indicate they can meet or exceed those three requirements (bandwidth, speed, availability). However, those are not contractual terms, those are the company saying they can meet or exceed those metrics. If it’s not in the contract, it’s just a statement of capabilities, not a commitment. A service level agreement will specify minimum bandwidth availability during a 24-hour period, 7 days a week. It would state that you will have access to [insert bandwidth metric] 24 hours a day, seven days a week until [insert termination metric or trigger]. This way, the vendor can’t provide you the bandwidth you requested only from 11 P. M. to 6 A. M. on Saturdays and Sundays and short you the rest of the time. Granted, most vendors are on the level and want to provide the services to you they’ve agreed upon, but that’s why contracts exist—to clearly define who does what, when, and at what cost. This keeps the guess work (and the finger pointing) to a minimum.

Compare Vendor Proposal/Response to Requirements
Once you receive vendor responses to your proposals, you should evaluate how closely each vendor comes to meeting the requirements of your plan. Any vendor that does not meet the requirements should not be considered further. There may be two exceptions to this. First, if your requirements are unique enough that no single vendor can meet your needs, you may have to circle back and find two or more vendors who can work together to meet your unique requirements. Second, you may discover from vendor responses that your requirements were too broad, inclusive, or vague, and that none of the vendors’ responses meet your requirements exactly. In that case you may have to refine your requirements and go back out for bid. Assuming your requirements are well written, your next step is to eliminate vendors that cannot meet your needs and focus only on those vendors who addressed your requirements fully in their responses.

Identify Requirements Not Met by Vendor Proposal
If there are one or more requirements not met by any vendor, you may need to find two or more vendors to work together to provide the full range of services you need. If none of the vendors met a particular requirement, you may also choose to review that requirement and reassess it in light of vendor responses. Remember, you contract with vendors in order to leverage their specific expertise. If none of them meet a particular requirement, you may wish to talk with several of your short-list selections to find out why they did not address that aspect. It might be redundant or otherwise unneeded. In that case, you should revise your requirements to reflect this new information.

Identify Vendor Options Not Specified in Requirements
Vendors also may offer additional options not specified in your requirements. Again, based on the vendor’s expertise, they may offer additional choices that can round out your requirements or plan. Utilizing their expertise can be a good way of ensuring you have the best solution in place. For example, the vendor might say (in essence),"Everyone who’s asked for A, B, and C also has found that D was an extremely important option they’d overlooked. Perhaps you’d like to add D to your plan as well."They may be sharing industry expertise and best practices with you, or they may simply be trying to up-sell you. You’ll have to look carefully at these options and perhaps do some independent research to determine whether these options are "must have,""nice to have," or "useless add-ons." If you have an established relationship with these vendors, they’ll more than likely offer you additional options but won’t put pressure on you to upgrade unless they feel it’s vital to your success. However, we all know there are sales people that will try to sell you every option they can think of just to make a bigger sale, so you have to be an active participant in the transaction. Know your options, know what makes sense, do some additional research, and determine if any of the additional options would enhance your plan or fill in gaps you didn’t realize existed. Don’t be forced into upgrades and options you don’t really need just because you have a very persuasive sales person in front of you.

From the Trenches...
Managing the Sales Process
Sometimes your purchasing department manages the purchase of goods and services, but when you’re talking about the purchase of backup, storage, or alternate site services for your BC/DR plan, there’s a good chance you will be directly involved. If you haven’t been involved with the sales process in the past, you might find yourself being swayed by excellent sales people—the ones who can convince you that you need something you really don’t. Most sales people are honest and are trying to balance their need to sell with your need for the product or service they’re selling. They also realize that loyal customers are borne out of an honest sales experience, not out of strong-arming someone into purchasing more than they need. In order to be successful in the process, take time to be clear about your objectives before a sales meeting. If you intend on making a purchasing decision at that time, write down the terms or parameters you will accept. Keep these to yourself but know that this is your bottom line. If the sales person cannot or will not meet your bottom line objectives, there is no deal to be struck.

The same holds true in any negotiation—know what your bottom line is and work toward meeting (or exceeding) that bottom line. If you have developed clear requirements and you know your bottom line, you should be able to successfully navigate the sometimes tricky sales process. Negotiation skills can help you in all aspects of life and they’ll certainly help you in the business world. If you’re interested in learning more about the art of negotiation, there are thousands of helpful books, courses, and seminars you can turn to for more information.


Page: 1, 2

next page



Windows IT Pro Home Register FAQ for Windows WinInfo News
Europe Edition About Us Contact Us/Customer Service Media Kit Affiliates / Licensing  
SQL Server Magazine Office & SharePoint Pro Windows Dev Pro IT Job Hound ITTV
IT Library Technology Resource Directory Connected Home Windows Excavator Windows SuperSite 
 
 Windows IT Pro is a Division of Penton Media Inc.
 Copyright © 2008 Penton Media, Inc., All rights reserved. Terms and Use | Privacy Statement | Reprints and Licensing