9h ago
Senior Software Engineer
Menlo Park, CA
$196k-$230k / year
full-timesenior Hybridfinance
๐ Tech Stack
๐ผ About This Role
You'll join the new Robinhood Command Center (RCC) team, serving as the frontline for production incident detection and mitigation. Your core impact will be defining incident response processes and building reliability tooling across the company. This high-visibility role allows you to shape how Robinhood learns from incidents at scale.
๐ฏ What You'll Do
- Lead incident mitigation and coordinate service owners during active incidents
- Develop and maintain incident management processes and procedures
- Own global dashboards and alerts tied to critical user journeys
- Drive post-incident governance and follow-up tracking for reliability improvements
๐ Requirements
- 5+ years of software engineering experience
- 2+ years focused on reliability engineering or production operations
- Hands-on incident leadership role (e.g., incident commander)
- Deep knowledge of observability frameworks like OpenTelemetry, Prometheus, Grafana
โจ Nice to Have
- Experience with multi-region or multi-cluster architectures
- Familiarity with capacity planning and failover strategies
- Demonstrated ability to improve MTTD or MTTR
๐ Benefits & Perks
- ๐ฐ Performance driven compensation with multipliers, bonus, equity, 401(k) matching
- ๐ฉบ 100% paid health insurance for employees, 90% for dependents
- ๐๏ธ Lifestyle wallet for wellness, learning, and more
- ๐ง Mental health benefits and fertility benefits
- ๐ฝ๏ธ Exceptional office experience with catered meals and events
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Hiring Managerยท 45 min
0 0 0