9h ago

Software Engineer, Cloud Infrastructure

Redwood City, CA

$180k-$300k / year

full-timeai-ml

🛠 Tech Stack

💼 About This Role

You'll lead the design and operation of highly available cloud infrastructure for AI training pipelines at DatologyAI, an AI data curation startup backed by top investors. You'll have deep impact as an early engineering hire, shaping infrastructure strategy and culture.

🎯 What You'll Do

  • Architect and maintain multi-cloud infrastructure across AWS and other providers
  • Define and implement infrastructure-as-code using Terraform or CloudFormation
  • Design and manage Kubernetes clusters for ML training and inference
  • Build monitoring, alerting, and logging systems for high availability

📋 Requirements

  • 5+ years of experience building cloud infrastructure
  • Deep experience with AWS and containerized architectures
  • Strong expertise in Kubernetes and Terraform
  • Proficiency in Bash, Python, or Go scripting

✨ Nice to Have

  • Experience supporting ML workloads (GPU orchestration, training pipelines)
  • Exposure to multi-cloud or hybrid-cloud setups
  • Background in cost optimization tools for cloud

🎁 Benefits & Perks

  • 🏥 100% covered health benefits (medical, vision, dental)
  • 💰 401(k) with 4% company match
  • 🏖️ Unlimited PTO
  • 💪 $2,000 annual wellness stipend
  • 📚 $1,000 annual learning stipend
0 0 0