3h ago
Senior Site Reliability Engineer
White Plains, NY
$170k-$200k / year
full-timesenior HybridVeterinary emergency services
🛠 Tech Stack
💼 About This Role
You'll lead platform resilience for DogByte, a proprietary system critical to 24/7/365 emergency veterinary care. Your work will ensure hospital-to-hospital traffic isolation and transform infrastructure into a self-healing system. This role offers the opportunity to impact life-saving operations at a rapidly growing company.
🎯 What You'll Do
- Formulate strategies for year-over-year volume increases and traffic isolation
- Configure data flows for high-concurrency and reliability
- Build automated processes for traffic spikes and error remediation
- Set up monitoring and alerting to resolve latency proactively
📋 Requirements
- 5+ years in SRE/DevOps roles with high-concurrency environments
- Deep understanding of AWS managed through Infrastructure as Code
- Expertise in traffic management, load balancing, Nginx, and autoscaling
- Technical leadership in observability frameworks and monitoring
✨ Nice to Have
- Experience with Python development
- Familiarity with PostgreSQL and RDS
🎁 Benefits & Perks
- 💰 Competitive Compensation ($170k-$200k) + bonus
- 🩺 Comprehensive Health & Wellness starting day one
- 👶 Paid Parental Leave up to 10 weeks at 100% salary
- 🏖️ Unlimited PTO
- 🐾 Bring Your Pet to Work and free lunches twice a week
0 0 0