What are the responsibilities and job description for the Manager, IT Disaster Recovery and Incident Management position at Search Services?
Summary
The Manager of IT Disaster Recovery and Incident Management is responsible for overseeing disaster recovery planning and incident response efforts, ensuring business continuity in a large-scale IT environment. This role requires strategic planning, cross-functional leadership, and crisis management expertise, particularly in infrastructure and application recovery. The ideal candidate has a strong IT background, experience in disaster recovery and incident response, and the ability to lead teams under high-pressure situations.
Description
Education:
The Manager of IT Disaster Recovery and Incident Management is responsible for overseeing disaster recovery planning and incident response efforts, ensuring business continuity in a large-scale IT environment. This role requires strategic planning, cross-functional leadership, and crisis management expertise, particularly in infrastructure and application recovery. The ideal candidate has a strong IT background, experience in disaster recovery and incident response, and the ability to lead teams under high-pressure situations.
Description
- Lead and manage disaster recovery planning, ensuring alignment with business continuity and risk management goals.
- Oversee incident management processes, serving as the primary IT contact during crisis events and working with Crisis Management teams to coordinate responses.
- Conduct disaster recovery drills, tabletop exercises, and post-incident reviews, continuously improving response strategies.
- Collaborate with infrastructure, security, and operations teams to ensure the resilience of critical IT systems and applications.
- Develop and maintain a disaster recovery framework, including runbooks, recovery time objectives (RTOs), and recovery point objectives (RPOs).
- Ensure compliance with industry regulations and standards such as PCI DSS, GDPR, and SOX for data protection and disaster recovery.
- Provide regular updates to executives and stakeholders regarding disaster recovery readiness and incident response outcomes.
- Lead cross-functional teams during crisis events, ensuring clear communication and coordinated actions.
- Manage vendor relationships for disaster recovery services, ensuring effective escalation processes for third-party dependencies.
- Prepare project charters, identify stakeholders, and oversee disaster recovery projects from planning to execution.
- Stay updated on emerging technology trends and threats, ensuring disaster recovery plans remain relevant and effective.
Education:
- Bachelor’s degree in Computer Science, Information Technology, Business Management, or a related field.
- Industry certifications in IT project management, disaster recovery, or incident management (e.g., PMP, CISSP, ITIL) preferred.
- 7-10 years of experience in IT project management, including large-scale infrastructure and application recovery projects.
- Extensive experience in disaster recovery planning, implementation, and testing, ensuring minimal downtime and data loss.
- 3-5 years of experience in incident management, leading responses to cybersecurity threats, service outages, and natural disasters.
- Experience in a large retail environment with knowledge of distribution, logistics, and omnichannel systems is a plus.
- Familiarity with business continuity planning, scenario-based recovery exercises, and crisis communication strategies.
- Strong knowledge of disaster recovery and incident management frameworks, best practices, and industry standards.
- Expertise in project management methodologies, including Agile, Waterfall, and hybrid models.
- Strong problem-solving and risk mitigation skills, with a proactive approach to identifying and addressing threats.
- Demonstrated experience in strategic planning, budget forecasting, and service delivery improvement.
- Exceptional communication skills, with the ability to convey complex information to technical and non-technical stakeholders.
- Strong leadership and decision-making abilities, particularly in crisis scenarios.
- Proficiency in project management and business continuity tools (e.g., Microsoft Project, Jira, or equivalent platforms).
- Familiarity with cloud infrastructure, virtualized environments, and data replication technologies.
- Ability to remain calm and focused under pressure, providing clear guidance and leadership in emergency situations.
- Highly organized and detail-oriented, capable of managing multiple tasks and projects simultaneously.
- Self-motivated, with strong time management and prioritization skills in a fast-paced environment.
- Regular attendance required, with adherence to company policies and professional standards.