1h ago

Site Reliability Engineer - All Levels

Remote

$89,155-$287,488 / year

full-time Remote3D content delivery / Spatial computing

Tech Stack

Description

You will take ownership of front-line operational support for platforms delivering 3D/4D Gaussian splat content to AR/VR devices. Initially focus on making on-call sustainable and effective, then reduce toil through automation and build tooling that lets the team operate at higher levels of the stack.

Requirements

  • 3-5+ years production operations with on-call experience
  • Expertise in instrumenting systems for operational visibility
  • Strong debugging skills across distributed systems
  • Experience with observability tools (Prometheus, Grafana, ELK)
  • Experience with cloud platforms (AWS, CoreWeave) and IaC (Kubernetes, Helm, Terraform)

Responsibilities

  • Participate in on-call rotations for 24/7 availability of streaming services
  • Instrument systems with telemetry, metrics, and logging
  • Design and tune alerting systems to minimize false positives
  • Lead incident response and post-mortems to drive improvements
  • Identify and eliminate toil through automation and runbook development
0 views 0 saves 0 applications