1h ago

Staff Platform Engineer - High Performance Computing

Singapore, Singapore

$160k-$215k / yearest.

full-timelead

🛠 Tech Stack

+1

💼 About This Role

You'll lead HPC infrastructure platform management and optimization for a dynamic team. Your core impact is designing, implementing, and maintaining resilient, scalable, and secure HPC systems including compute, storage, and networking. You'll collaborate with data scientists and developers to optimize application performance on cutting-edge HPC platforms.

🎯 What You'll Do

  • Design and manage HPC infrastructure including compute, storage, and networking.
  • Lead implementation of job scheduling systems and resource allocation.
  • Monitor and optimize HPC cluster performance and capacity.
  • Ensure security, compliance, and access controls for HPC platform.

📋 Requirements

  • 8+ years experience in managing HPC systems.
  • Strong knowledge of HPC architectures (clusters, grids, clouds).
  • Experience with job scheduling systems such as Slurm, Torque, LSF.
  • Experience with scripting languages like Python, Perl, or Bash.

✨ Nice to Have

  • Certifications in NVIDIA AI Infrastructure and Certified Kubernetes Administrator.
  • Experience with ML/DL frameworks like TensorFlow or PyTorch.
  • Familiarity with agile development and version control (Git).

🎁 Benefits & Perks

  • 🌟 Meaningful work with modern tech stacks
  • 🛠️ Excellent engineering culture and work-life balance
  • 📈 Empowerment to innovate and grow together
0 0 0