6h ago
Software Engineer, Agent Infrastructure
San Francisco, CA; New York City, NY
$230k-$385k / year
full-time Hybridai-ml
🛠 Tech Stack
💼 About This Role
You'll build and scale systems for training and deploying AI agents, working closely with research and product teams. Your work will push massive compute clusters to their limits and power products like Codex and Operator. This role offers the chance to impact agentic AI infrastructure at the forefront of AI research.
🎯 What You'll Do
- Push massive compute clusters to their limits with novel container orchestration.
- Develop and maintain FastAPI and gRPC APIs for agentic infrastructure.
- Use Terraform to stand up and evolve complex infrastructure.
- Collaborate with research teams to optimize training systems.
📋 Requirements
- Deep experience with large-scale machine learning infrastructure.
- Ability to build from 0-1 and scale 1,000,000x.
- Keen eye for performance optimization in distributed systems.
- Expertise with cloud platforms and infrastructure-as-code like Terraform.
✨ Nice to Have
- Expertise in virtualization and containerization (e.g., Kata, Firecracker, gVisor, Sysbox).
🎁 Benefits & Perks
- 💰 Competitive compensation with equity
- 🏖️ Flexible time off
- 🏠 Hybrid work with relocation assistance
0 0 0