2h ago
Database Reliability Engineer - Core Team
United Kingdom
full-timesenior RemoteCloud Computing / Data Analytics
Tech Stack
Description
You will build and lead processes to ensure the reliability, availability, scalability, and performance of ClickHouse. You will collaborate with multiple teams, manage escalation and incident response, and drive continuous improvement of ClickHouse in the cloud.
Requirements
- Bachelor's or Master's degree in Computer Science or related field
- At least 5 years of experience in Reliability Engineering or related field
- Experience operating ClickHouse or other SQL databases in production
- Scripting experience with Shell or Python, ability to read C++ code
- Knowledge of cloud computing platforms (AWS, Azure, GCP)
Responsibilities
- Continuously improve reliability and performance of ClickHouse core
- Improve and create metrics and alerts to prevent production issues
- Investigate root causes of customer problems and submit bug fixes or improvements
- Enhance incident response and post-mortem analysis for ClickHouse core outages
- Plan and drive Chaos initiatives across engineering teams
0 views 0 saves 0 applications