14h ago

Site Reliability Engineer

Emeryville, CA

$100k-$300k / year

full-timeai-ml Visa Sponsor

🛠 Tech Stack

💼 About This Role

You'll own the digital infrastructure powering a neuroscience-informed AGI research program. You'll ensure reliable access to compute resources, cluster health visibility, and auto-scaling. This role offers the chance to work on ambitious goals at a high-velocity startup foundation.

🎯 What You'll Do

  • Manage compute access for researchers
  • Monitor resource utilization and cluster health
  • Implement auto-scaling of compute resources
  • Automate operational processes for efficiency

📋 Requirements

  • Ownership accountability for cluster health
  • Systems intuition across schedulers, containers, networking, storage
  • Operational rigor in observability and reproducibility
  • Pragmatism supporting experimental research workloads

🎁 Benefits & Perks

  • 🏖️ Unlimited PTO
  • 💰 Competitive salary
  • 🏢 In-person collaboration in Emeryville, CA
  • 🛂 Visa sponsorship available
  • 🚀 High-velocity startup culture
0 0 0