8h ago

Senior Site Reliability Engineer

Czechia

✨ $150k-$200k / yearest.

full-timesenior Remotesoftware

πŸ›  Tech Stack

+3

πŸ’Ό About This Role

You'll be the reliability backbone of an AI-first data team, ensuring deployments, pipelines, and observability for data-intensive services. Your work directly impacts enterprise data platforms and agentic AI workloads. This role offers the chance to shape the next-generation data infrastructure for a leading personalization platform.

🎯 What You'll Do

  • Build and maintain reliability ecosystem for DataCraft services on GCP and Kubernetes
  • Ensure end-to-end observability across Kafka, Airflow, Databricks, and BigQuery
  • Own and evolve Terraform-based infrastructure and CI/CD pipelines
  • Participate in L3 on-call rotation and incident resolution

πŸ“‹ Requirements

  • 5+ years of experience in site reliability engineering or DevOps
  • Expertise in GCP and Kubernetes
  • Proficiency in Terraform for infrastructure as code
  • Experience with monitoring and observability tools (Prometheus, Grafana)

✨ Nice to Have

  • Experience with data pipelines (Kafka, Airflow, Spark)
  • Knowledge of agentic AI platforms (LLM APIs, MCP)
  • Familiarity with multi-cloud (Snowflake, Databricks)

🎁 Benefits & Perks

  • πŸ–οΈ Unlimited PTO
  • 🏠 Full remote work from home
  • πŸ’» Latest tech gear
  • πŸ“š Learning budget for conferences and courses
  • πŸ‹οΈ Wellness program (e.g., gym membership)

πŸ“¨ Hiring Process

Estimated timeline: 2-4 weeks Β· AI estimate

  1. 1Recruiter callΒ· 30 min
  2. 2Technical interviewΒ· 60 min
  3. 3Hiring manager interviewΒ· 45 min
0 0 0