1d ago
Manager, Software Engineering
Remote US
$200k-$275k / year
full-timelead Remote
🛠 Tech Stack
💼 About This Role
You'll lead Affirm's Resilience Engineering team, ensuring production system reliability through proactive validation techniques like load testing and chaos engineering. Your team builds platforms for safe production experimentation, embedding resilience into the development lifecycle.
🎯 What You'll Do
- Define and drive vision for resilience engineering with focus on production load testing and chaos engineering
- Lead and mentor a team building platforms for safe production experimentation
- Partner with infrastructure and product leadership to embed resilience validation into SDLC
- Design and evolve platforms enabling safe, controlled production load testing and fault injection
📋 Requirements
- Proven experience leading engineering teams in reliability, infrastructure, or distributed systems
- Hands-on experience with production load testing, chaos engineering, or large-scale system validation
- Experience with a chaos engineering vendor such as Gremlin, Harness, or similar
- Strong understanding of failure modes in distributed systems including latency, partial failure, cascading outages
✨ Nice to Have
- Familiarity with cloud-native environments (AWS, Kubernetes) and observability tooling
- Strong programming background (Python, Kotlin, Java, or similar)
- Experience building systems with strong safety guarantees
🎁 Benefits & Perks
- 🏥 100% subsidized medical, dental, and vision coverage for you and dependents
- 💰 Equity rewards and Employee Stock Purchase Plan (ESPP) at a discount
- 🛠️ Flexible Spending Wallets for Technology, Food, Lifestyle, and family forming expenses
- 🏖️ Competitive vacation and holiday schedules to rest and recharge
📨 Hiring Process
Estimated timeline: 2-4 weeks · AI estimate
- 1Phone Screen· 30 min
- 2Technical Interview· 60 min
- 3Hiring Manager Interview· 60 min
0 0 0