2h ago

Site Reliability Engineer - Media Production Infrastructure

Cupertino, California, United States
full-timeseniorMedia Production

Tech Stack

Description

You will ensure the high availability, performance, and resilience of critical server, storage, and media workflow systems in a world-class media production environment. You'll manage SAN infrastructure, custom monitoring dashboards, and provide on-site support with a 24/7 on-call rotation.

Requirements

  • 14+ years experience with macOS and SAN environments, preferably Xsan
  • Deep expertise in Fibre Channel networking and hardware RAIDs
  • Thorough knowledge of macOS ACLs, POSIX permissions, and Directory Services
  • Experience installing and configuring Prometheus and Grafana
  • Experience with Shell Scripting

Responsibilities

  • Maintain and troubleshoot all production hardware, servers, and storage infrastructure
  • Execute key maintenance and support for the SAN environment
  • Manage custom dashboards for 24/7 monitoring of systems
  • Contribute to development and maintenance of custom applications and dashboards for media workflows
  • Manage Backup and Archive environment, including tape systems and cloud archiving
0 views 0 saves 0 applications