5h ago
Senior Reliability Engineer (Data Infrastructure)
Lisbon, Portugal
full-timesenior HybridFinancial Technology (FinTech)
Tech Stack
+2
Description
You will ensure the reliability, availability, and performance of critical data systems on AWS and GCP. Responsibilities include designing and maintaining data infrastructure services, defining SLOs/SLAs, participating in on-call rotations, and automating cloud infrastructure using Terraform and Helm.
Requirements
- Experience as SRE managing cloud infrastructure (AWS/GCP) and data systems (Kafka, Spark, Elasticsearch, PostgreSQL, Cassandra)
- Proven track record improving reliability and availability in production
- Proficiency in Terraform and Helm
- Experience managing distributed databases within Kubernetes
- Strong scripting and automation skills
Responsibilities
- Design and maintain highly available data infrastructure (SQL, NoSQL, Kafka, Spark)
- Define and monitor SLOs/SLAs
- Participate in on-call rotation and incident response
- Automate cloud infrastructure using Terraform and Helm (GitOps)
- Implement monitoring, logging, and tracing solutions
0 views 0 saves 0 applications