2h ago

Senior Site Reliability Engineer

Remote (Europe)

$348,000-$414,000 / year

full-timesenior Remotedocument workflow automation

Tech Stack

+2

Description

You will own and influence the incident management process, maintain the observability stack, participate in on-call rotation, develop automations for reliability, and collaborate with product engineers to embed SRE principles. You'll work with a remote-first team to ensure PandaDoc's platform runs smoothly for over 67,000 customers.

Requirements

  • Solid programming experience in Python (Django, AsyncIO) and/or Java (Spring Boot)
  • Experience with LGTM observability suite (Loki, Grafana, Tempo, Mimir)
  • Experience developing and maintaining Python services in production
  • Strong experience with AWS and Kubernetes
  • Proficiency with relational databases (PostgreSQL) and messaging systems (RabbitMQ, NATS, Kafka)

Responsibilities

  • Own and influence the incident management process end-to-end
  • Maintain and evolve on-prem observability stack
  • Keep production applications running via on-call rotation
  • Develop automations and tools for platform reliability
  • Collaborate with product engineers to foster SRE principles
0 views 0 saves 0 applications