2h ago

Software Engineer, Engineering Ops

New York, NY
full-timeseniorFinancial Technology

Tech Stack

Description

You will build and automate the operational foundation for Ridgeline's platform, focusing on incident response, observability, FinOps, and compliance. Collaborate with SRE and product engineers to reduce toil and improve system reliability at scale.

Requirements

  • 5+ years in SRE, DevOps, or Production Engineering
  • Operational automation with Python, Go, or Bash
  • Deep understanding of observability stacks like Datadog, Prometheus, ELK, or OpenTelemetry
  • Practical experience with FinOps dashboards and cost tagging strategies
  • Hands-on experience with incident response and root cause analysis

Responsibilities

  • Design and implement automation for operational workflows like tenant provisioning and patch coordination
  • Drive incident response and post-incident improvement through runbooks and root cause automation
  • Lead design and implementation of unified observability frameworks
  • Build and maintain dashboards and telemetry pipelines for FinOps cost optimization
  • Define and manage system manifests with ownership, dependencies, and operational metadata
0 views 0 saves 0 applications