8h ago
Senior Site Reliability Engineer
Costa Mesa, California
$166k-$220k / year
full-timesenior
๐ Tech Stack
+2
๐ผ About This Role
You'll ensure the reliability and performance of autonomous systems for mission-critical defense applications. Your work directly enables software to operate flawlessly in cloud, hardware-in-the-loop, and air-gapped environments. You'll bridge development and operations, tackling challenges where failure is not an option.
๐ฏ What You'll Do
- Manage and expand on-premises developer servers and HITL systems
- Design and maintain highly available fault-tolerant autonomous systems
- Eliminate performance bottlenecks for low-latency real-time operations
- Develop monitoring, logging, tracing, and alerting solutions
- Automate provisioning, deployment, testing, and recovery tasks
๐ Requirements
- 5+ years in Site Reliability Engineering or DevOps for mission-critical apps
- Strong proficiency in Python or Go
- Experience with Ansible, Puppet, or Terraform
- Deep expertise with Linux operating systems and command-line skills
โจ Nice to Have
- Experience with edge computing or mesh networks
- Familiarity with Splunk or similar monitoring tools
- Prior experience in defense, aerospace, or robotics
๐ Benefits & Perks
- ๐ฅ Comprehensive health coverage at little to no cost
- ๐ฐ Highly competitive equity grants included in offers
- ๐๏ธ Paid time off and flexible schedules
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Phone Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Hiring Manager Interviewยท 45 min
0 0 0