1h ago

Disaster Recovery & Capacity Engineer

Manchester, United Kingdom
full-timemidEnterprise Software

Tech Stack

Description

You will design and implement solutions for disaster response and capacity planning to ensure business continuity and efficient resource use. You will collaborate with engineering teams to develop tooling, test recovery plans, and optimize platform capacity.

Requirements

  • Strong experience in IT operations/Production Engineering
  • Experience with Linux administration
  • Experience with Kubernetes administration
  • Proficiency in a scripting language for automation
  • Understanding of recovery solutions and high availability architectures for cloud and on-premises

Responsibilities

  • Develop and implement capacity planning tooling, frameworks, policies, and strategies
  • Build tooling to support disaster recovery plans and track KPI maturity
  • Collaborate with Engineering Service Management to ensure DR strategies are adequate and tested
  • Provide capacity requirements and impact assessments for new services or changes
  • Provide ongoing feedback for risk management, mitigation, and prevention
0 views 0 saves 0 applications