2h ago
Site Reliability Engineer
San Francisco
$130k-$500k / year
full-timeseniorai-ml
🛠 Tech Stack
💼 About This Role
You'll own production reliability for critical systems at Mercor, a Series C company defining the future of AI-powered work. You'll partner with infrastructure leadership to build the SRE function from the ground up and shape how we operate large-scale, high-availability systems.
🎯 What You'll Do
- Own reliability for core shared services and customer-facing systems.
- Define SRE priorities and reliability standards with infrastructure leadership.
- Improve production system structure for stability and observability.
- Champion SRE practices like incident response and SLIs/SLOs across teams.
📋 Requirements
- 5+ years of true SRE experience (not just operations).
- Deep familiarity with Google SRE practices (error budgets, risk trade-offs).
- Proven success operating large-scale distributed systems.
- Ability to drive cultural change around reliability while staying hands-on.
✨ Nice to Have
- Experience as a founding SRE or early SRE hire.
- Hands-on experience with AWS, Kubernetes, and IaC tooling.
🎁 Benefits & Perks
- 💰 Equity included.
🚩 Heads Up
- Salary range is very wide ($130K-$500K), indicating potential lack of role clarity.
0 0 0