3h ago

Senior Site Reliability Engineer

White Plains, NY

$170k-$200k / year

full-timesenior HybridVeterinary emergency services

🛠 Tech Stack

💼 About This Role

You'll lead platform resilience for DogByte, a proprietary system critical to 24/7/365 emergency veterinary care. Your work will ensure hospital-to-hospital traffic isolation and transform infrastructure into a self-healing system. This role offers the opportunity to impact life-saving operations at a rapidly growing company.

🎯 What You'll Do

  • Formulate strategies for year-over-year volume increases and traffic isolation
  • Configure data flows for high-concurrency and reliability
  • Build automated processes for traffic spikes and error remediation
  • Set up monitoring and alerting to resolve latency proactively

📋 Requirements

  • 5+ years in SRE/DevOps roles with high-concurrency environments
  • Deep understanding of AWS managed through Infrastructure as Code
  • Expertise in traffic management, load balancing, Nginx, and autoscaling
  • Technical leadership in observability frameworks and monitoring

✨ Nice to Have

  • Experience with Python development
  • Familiarity with PostgreSQL and RDS

🎁 Benefits & Perks

  • 💰 Competitive Compensation ($170k-$200k) + bonus
  • 🩺 Comprehensive Health & Wellness starting day one
  • 👶 Paid Parental Leave up to 10 weeks at 100% salary
  • 🏖️ Unlimited PTO
  • 🐾 Bring Your Pet to Work and free lunches twice a week
0 0 0