2h ago
Software Engineer, Engineering Ops
New York, NY
full-timeseniorFinancial Technology
Tech Stack
Description
You will build and automate the operational foundation for Ridgeline's platform, focusing on incident response, observability, FinOps, and compliance. Collaborate with SRE and product engineers to reduce toil and improve system reliability at scale.
Requirements
- 5+ years in SRE, DevOps, or Production Engineering
- Operational automation with Python, Go, or Bash
- Deep understanding of observability stacks like Datadog, Prometheus, ELK, or OpenTelemetry
- Practical experience with FinOps dashboards and cost tagging strategies
- Hands-on experience with incident response and root cause analysis
Responsibilities
- Design and implement automation for operational workflows like tenant provisioning and patch coordination
- Drive incident response and post-incident improvement through runbooks and root cause automation
- Lead design and implementation of unified observability frameworks
- Build and maintain dashboards and telemetry pipelines for FinOps cost optimization
- Define and manage system manifests with ownership, dependencies, and operational metadata
0 views 0 saves 0 applications