8h ago

Senior Site Reliability Engineer

Costa Mesa, California

$166k-$220k / year

full-timesenior

๐Ÿ›  Tech Stack

+2

๐Ÿ’ผ About This Role

You'll ensure the reliability and performance of autonomous systems for mission-critical defense applications. Your work directly enables software to operate flawlessly in cloud, hardware-in-the-loop, and air-gapped environments. You'll bridge development and operations, tackling challenges where failure is not an option.

๐ŸŽฏ What You'll Do

  • Manage and expand on-premises developer servers and HITL systems
  • Design and maintain highly available fault-tolerant autonomous systems
  • Eliminate performance bottlenecks for low-latency real-time operations
  • Develop monitoring, logging, tracing, and alerting solutions
  • Automate provisioning, deployment, testing, and recovery tasks

๐Ÿ“‹ Requirements

  • 5+ years in Site Reliability Engineering or DevOps for mission-critical apps
  • Strong proficiency in Python or Go
  • Experience with Ansible, Puppet, or Terraform
  • Deep expertise with Linux operating systems and command-line skills

โœจ Nice to Have

  • Experience with edge computing or mesh networks
  • Familiarity with Splunk or similar monitoring tools
  • Prior experience in defense, aerospace, or robotics

๐ŸŽ Benefits & Perks

  • ๐Ÿฅ Comprehensive health coverage at little to no cost
  • ๐Ÿ’ฐ Highly competitive equity grants included in offers
  • ๐Ÿ–๏ธ Paid time off and flexible schedules

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Phone Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Hiring Manager Interviewยท 45 min
0 0 0