1h ago
HPC Operations Engineer
Chicago
full-timemidfinance
Tech Stack
Description
You will provide front-line operational support for a 24/7 Linux HPC environment, solving problems from research teams, responding to alerts, and participating in maintenance operations. You'll write code for diagnosing issues and automating tasks, managing vendor relationships, and developing monitoring tools in a fast-paced trading firm.
Requirements
- 2+ years professional experience with Linux systems
- 2+ years experience with HPC (parallel filesystems, batch systems, network interconnects)
- High proficiency with at least one programming/scripting language (Go, Python, or C)
- Strong verbal and written communication skills
- Willingness to work evenings/weekends for maintenance and on-call rotation
Responsibilities
- Provide front-line operational support for 24/7 Linux HPC compute, storage, and interconnects
- Solve problem reports and questions from research community, manage entire problem lifecycle
- Respond to alerts in timely fashion
- Participate in large, coordinated maintenance operations, including evenings and weekends
- Write code for diagnosing, resolving, and triaging problems and automating tasks
0 views 0 saves 0 applications