6h ago

Senior Site Reliability Engineer

Colorado Springs, CO

$180k-$220k / year

full-timeseniorsoftware

🛠 Tech Stack

+2

💼 About This Role

You'll own the reliability, scalability, and security of production deployments for a mission-critical military collaboration platform. You'll implement world-class observability and lead incident response. This role requires working on-site at customer locations in Colorado Springs with an active Top Secret clearance.

🎯 What You'll Do

  • Design and manage observability stack (Prometheus, Loki, Grafana)
  • Define and own SLIs and SLOs for system reliability
  • Lead incident response and blameless post-mortems
  • Automate secure Kubernetes clusters with Terraform and Ansible

📋 Requirements

  • Top Secret clearance active
  • 5+ years in Platform, DevOps, or SRE
  • Kubernetes design and operations expertise
  • Infrastructure as Code with Terraform or Ansible

✨ Nice to Have

  • Experience in DoD compliance (RMF, STIGs, ICD 503)
  • GitOps practices and toolchains
  • Experience designing SLIs/SLOs with error budgets

🎁 Benefits & Perks

  • 💰 Competitive equity package
  • 🏠 Relocation assistance provided
  • 🏖️ All-remote company culture
  • 🌍 Impact on national security missions
0 0 0