1h ago

HPC Operations Engineer

Chicago
full-timemidfinance

Tech Stack

Description

You will provide front-line operational support for a 24/7 Linux HPC environment, solving problems from research teams, responding to alerts, and participating in maintenance operations. You'll write code for diagnosing issues and automating tasks, managing vendor relationships, and developing monitoring tools in a fast-paced trading firm.

Requirements

  • 2+ years professional experience with Linux systems
  • 2+ years experience with HPC (parallel filesystems, batch systems, network interconnects)
  • High proficiency with at least one programming/scripting language (Go, Python, or C)
  • Strong verbal and written communication skills
  • Willingness to work evenings/weekends for maintenance and on-call rotation

Responsibilities

  • Provide front-line operational support for 24/7 Linux HPC compute, storage, and interconnects
  • Solve problem reports and questions from research community, manage entire problem lifecycle
  • Respond to alerts in timely fashion
  • Participate in large, coordinated maintenance operations, including evenings and weekends
  • Write code for diagnosing, resolving, and triaging problems and automating tasks
0 views 0 saves 0 applications