8h ago
Senior Site Reliability Engineer
Czechia
β¨ $150k-$200k / yearest.
full-timesenior Remotesoftware
π Tech Stack
+3
πΌ About This Role
You'll be the reliability backbone of an AI-first data team, ensuring deployments, pipelines, and observability for data-intensive services. Your work directly impacts enterprise data platforms and agentic AI workloads. This role offers the chance to shape the next-generation data infrastructure for a leading personalization platform.
π― What You'll Do
- Build and maintain reliability ecosystem for DataCraft services on GCP and Kubernetes
- Ensure end-to-end observability across Kafka, Airflow, Databricks, and BigQuery
- Own and evolve Terraform-based infrastructure and CI/CD pipelines
- Participate in L3 on-call rotation and incident resolution
π Requirements
- 5+ years of experience in site reliability engineering or DevOps
- Expertise in GCP and Kubernetes
- Proficiency in Terraform for infrastructure as code
- Experience with monitoring and observability tools (Prometheus, Grafana)
β¨ Nice to Have
- Experience with data pipelines (Kafka, Airflow, Spark)
- Knowledge of agentic AI platforms (LLM APIs, MCP)
- Familiarity with multi-cloud (Snowflake, Databricks)
π Benefits & Perks
- ποΈ Unlimited PTO
- π Full remote work from home
- π» Latest tech gear
- π Learning budget for conferences and courses
- ποΈ Wellness program (e.g., gym membership)
π¨ Hiring Process
Estimated timeline: 2-4 weeks Β· AI estimate
- 1Recruiter callΒ· 30 min
- 2Technical interviewΒ· 60 min
- 3Hiring manager interviewΒ· 45 min
0 0 0