1h ago

Software Engineer, Site Reliability

New York City; San Francisco, CA

$160,000-$300,000 / year

full-timeseniorArtificial Intelligence

Tech Stack

Description

You will own critical production systems end-to-end, designing, building, and improving them through production-quality code. Your work will keep the platform reliable at scale, influence architecture, and build internal tooling for all engineers at Hebbia.

Requirements

  • 5+ years software development with production services
  • Proficiency in Go, Python, C++, or Rust
  • Experience as SRE or Production Engineer owning services end-to-end
  • Deep understanding of distributed systems and container orchestration
  • Cloud platform fluency (AWS preferred)

Responsibilities

  • Own production services end-to-end from design to incident response
  • Profile and rewrite hot paths to eliminate bottlenecks
  • Lead incident response and drive post-mortem culture with code changes
  • Design and build observability frameworks with custom instrumentation
  • Define SLOs and build feedback loops for engineering accountability
0 views 0 saves 0 applications