1h ago
Disaster Recovery & Capacity Engineer
Manchester, United Kingdom
full-timemidEnterprise Software
Tech Stack
Description
You will design and implement solutions for disaster response and capacity planning to ensure business continuity and efficient resource use. You will collaborate with engineering teams to develop tooling, test recovery plans, and optimize platform capacity.
Requirements
- Strong experience in IT operations/Production Engineering
- Experience with Linux administration
- Experience with Kubernetes administration
- Proficiency in a scripting language for automation
- Understanding of recovery solutions and high availability architectures for cloud and on-premises
Responsibilities
- Develop and implement capacity planning tooling, frameworks, policies, and strategies
- Build tooling to support disaster recovery plans and track KPI maturity
- Collaborate with Engineering Service Management to ensure DR strategies are adequate and tested
- Provide capacity requirements and impact assessments for new services or changes
- Provide ongoing feedback for risk management, mitigation, and prevention
0 views 0 saves 0 applications