2h ago
Site Reliability Engineer - Media Production Infrastructure
Cupertino, California, United States
full-timeseniorMedia Production
Tech Stack
Description
You will ensure the high availability, performance, and resilience of critical server, storage, and media workflow systems in a world-class media production environment. You'll manage SAN infrastructure, custom monitoring dashboards, and provide on-site support with a 24/7 on-call rotation.
Requirements
- 14+ years experience with macOS and SAN environments, preferably Xsan
- Deep expertise in Fibre Channel networking and hardware RAIDs
- Thorough knowledge of macOS ACLs, POSIX permissions, and Directory Services
- Experience installing and configuring Prometheus and Grafana
- Experience with Shell Scripting
Responsibilities
- Maintain and troubleshoot all production hardware, servers, and storage infrastructure
- Execute key maintenance and support for the SAN environment
- Manage custom dashboards for 24/7 monitoring of systems
- Contribute to development and maintenance of custom applications and dashboards for media workflows
- Manage Backup and Archive environment, including tape systems and cloud archiving
0 views 0 saves 0 applications