5h ago

Senior Reliability Engineer (Data Infrastructure)

Lisbon, Portugal
full-timesenior HybridFinancial Technology (FinTech)

Tech Stack

+2

Description

You will ensure the reliability, availability, and performance of critical data systems on AWS and GCP. Responsibilities include designing and maintaining data infrastructure services, defining SLOs/SLAs, participating in on-call rotations, and automating cloud infrastructure using Terraform and Helm.

Requirements

  • Experience as SRE managing cloud infrastructure (AWS/GCP) and data systems (Kafka, Spark, Elasticsearch, PostgreSQL, Cassandra)
  • Proven track record improving reliability and availability in production
  • Proficiency in Terraform and Helm
  • Experience managing distributed databases within Kubernetes
  • Strong scripting and automation skills

Responsibilities

  • Design and maintain highly available data infrastructure (SQL, NoSQL, Kafka, Spark)
  • Define and monitor SLOs/SLAs
  • Participate in on-call rotation and incident response
  • Automate cloud infrastructure using Terraform and Helm (GitOps)
  • Implement monitoring, logging, and tracing solutions
0 views 0 saves 0 applications