2h ago

Site Reliability Engineer

San Francisco

$130k-$500k / year

full-timeseniorai-ml

🛠 Tech Stack

💼 About This Role

You'll own production reliability for critical systems at Mercor, a Series C company defining the future of AI-powered work. You'll partner with infrastructure leadership to build the SRE function from the ground up and shape how we operate large-scale, high-availability systems.

🎯 What You'll Do

  • Own reliability for core shared services and customer-facing systems.
  • Define SRE priorities and reliability standards with infrastructure leadership.
  • Improve production system structure for stability and observability.
  • Champion SRE practices like incident response and SLIs/SLOs across teams.

📋 Requirements

  • 5+ years of true SRE experience (not just operations).
  • Deep familiarity with Google SRE practices (error budgets, risk trade-offs).
  • Proven success operating large-scale distributed systems.
  • Ability to drive cultural change around reliability while staying hands-on.

✨ Nice to Have

  • Experience as a founding SRE or early SRE hire.
  • Hands-on experience with AWS, Kubernetes, and IaC tooling.

🎁 Benefits & Perks

  • 💰 Equity included.

🚩 Heads Up

  • Salary range is very wide ($130K-$500K), indicating potential lack of role clarity.
0 0 0