2h ago
Senior Site Reliability Engineer
London, England, United Kingdom
full-timesenior Hybridfinancial technology
Tech Stack
Description
You will drive architectural improvements to enhance reliability, scalability, and efficiency across ClearScore's platform. You'll lead the evolution of our Kubernetes clusters, troubleshoot complex production issues, design automation tools, and mentor other engineers while collaborating with developers to improve observability and infrastructure.
Requirements
- Expert-level Kubernetes knowledge including cluster upgrades, networking, container runtimes, and node-level troubleshooting
- Strong AWS expertise in architecture, networking, and cost management
- Deep understanding of Linux internals, containerization, and OS-level performance tuning
- Proficiency in at least one compiled language (Go, Rust, C++) and one interpreted language (Python, Bash)
- Experience with CI/CD pipelines and tooling such as Jenkins, ArgoCD, or Spinnaker, including managing large-scale migrations
Responsibilities
- Drive architectural change through RFCs and platform-wide initiatives to improve reliability, scalability, and efficiency
- Lead and evolve Kubernetes platform, design, upgrade, and optimize clusters at scale
- Troubleshoot and resolve complex production issues in distributed systems and containerized environments
- Design and contribute to Kubernetes controllers and automation tools to improve infrastructure and developer experience
- Enhance AWS estate ensuring cost efficiency, security, and scalability while promoting best practices
0 views 0 saves 0 applications