Software Engineer, Productivity - Inference Runtime at OpenAI — CareerPair

13h ago

Software Engineer, Productivity - Inference Runtime

San Francisco, CA

$230k-$385k / year

full-timeseniorai-ml

🛠 Tech Stack

💼 About This Role

You'll build tooling and infrastructure for deploy gate validation and release engineering on OpenAI's inference runtime teams. Your work ensures model launches are safe, performant, and regression-free. This is high-impact infrastructure for one of the largest inference platforms in the world.

🎯 What You'll Do

Improve deploy gate validation tooling and infrastructure
Harden CI and testing infrastructure for reliability
Build automation for failure triage and escalation
Reduce developer friction in testing and release workflows

📋 Requirements

Experience with CI/CD systems and release tooling
Python proficiency for deployment infrastructure
Ability to debug complex distributed systems
Strong developer empathy and ownership mindset

✨ Nice to Have

C++ experience near inference engine code
Prior inference system knowledge

🎁 Benefits & Perks

💰 Competitive salary range $230K-$385K
🏖️ Flexible PTO
🏥 Health insurance (medical, dental, vision)
📈 Equity package
🤖 Access to latest AI tools

📨 Hiring Process

Estimated timeline: 2-4 weeks · AI estimate

1Recruiter Screen· 30 min
2Technical Phone Interview· 60 min
3Onsite Final Rounds· Half day

OpenAI

OpenAI Jobs

Other jobs at OpenAI

No other jobs found.

0 0 0