4h ago
Operations Engineer
Las Vegas, Nevada
✨ $70k-$90k / yearest.
full-timejuniorai-ml
🛠 Tech Stack
💼 About This Role
You'll join the Global Operations Center as the frontline for customer infrastructure reliability, monitoring AI compute environments at TensorWave. You'll detect issues before they impact workloads and serve as the L1 response to customer-reported problems. This role offers the chance to build the Operations Center from the ground up and directly protect critical customer workloads.
🎯 What You'll Do
- Monitor customer environments in real time using observability platforms
- Perform initial triage and classification of alerts
- Execute runbooks to diagnose and resolve infrastructure issues
- Coordinate with on-site teams and escalate to L2 engineering
📋 Requirements
- 1-3 years of experience in a NOC or similar operations role
- Experience with observability tools like Grafana, Datadog, or Prometheus
- Foundational Linux systems administration skills
- Basic understanding of networking fundamentals (TCP/IP, DNS, VLANs)
✨ Nice to Have
- Experience with GPU infrastructure or AI/ML compute environments
- Familiarity with Kubernetes container troubleshooting
- Scripting ability in Python or Bash
🎁 Benefits & Perks
- 📈 Stock Options
- 🏥 100% paid Medical, Dental, and Vision for employees
- 💰 Company HSA Contributions
- 🏖️ Flexible PTO
- 👶 Parental Leave
0 0 0