3h ago
Site Reliability Engineer (Senior or Staff), Atlas
Austin; Boston; Chicago; New York City; Pittsburgh; United States
full-timesenior Remotedatabase software
Tech Stack
Description
Join the SRE Atlas team to design and build complex systems, create tooling and automation, and maintain the Atlas platform. You'll collaborate with software engineering teams to ensure reliability and resilience for critical customer applications across multiple clouds.
Requirements
- 5+ years experience running critical systems at scale
- Familiarity with AWS, Azure, or GCP and multi-cloud systems
- Strong Linux fundamentals
- Proficiency in Go, Ruby, or Python
- Solid understanding of web protocols (HTTP, TLS, DNS)
Responsibilities
- Develop reliable multi-cloud platform for business-critical applications
- Collaborate with service teams to solve technical challenges and build tooling
- Participate in 24/7 on-call rotation to ensure high availability
0 views 0 saves 0 applications