5 days ago
MLOps / Infrastructure Engineer
New York City
$130,000-$230,000 / year
full-timesenior RemoteArtificial Intelligence
Tech Stack
Description
You'll design and maintain cloud infrastructure to support real-time ML model serving, data ingestion, and evaluation workflows. You'll deploy and optimize APIs for low-latency access to ML models and embedding search systems, working closely with ML engineers and researchers to build infrastructure that makes high-performance models accessible and reliable in production.
Requirements
- 3–8 years of experience deploying machine learning systems or high-availability backend systems
- Has shipped and maintained production infrastructure at scale, supporting ML workflows
- Experience with GCP, AWS, or similar platforms (including managed ML services)
- Proficient in Terraform, Docker, Kubernetes, or similar infra tools
- Understands performance tradeoffs in serving models and embedding search pipelines
- Can work cross-functionally with ML, security, and product teams to deploy safely and iterate fast
- Brings a builder's mindset and bias for ownership in ambiguous environments
Responsibilities
- Design and maintain cloud infrastructure (GCP or AWS) to support real-time model serving, data ingestion, and evaluation workflows
- Deploy and optimize APIs for low-latency access to ML models and embedding search systems
- Manage and optimize the end-to-end training data flow—from sourcing and cleaning datasets to preparing them for model consumption—ensuring accuracy, scalability, and efficiency
- Build observability tooling for production ML pipelines (monitor latency, error rates, request volumes, drift)
- Automate model deployment, retraining, and evaluation pipelines (CI/CD for ML)
- Work with ML engineers to package models for serving
- Help manage vector databases and semantic search infrastructure (e.g., Pinecone, FAISS, Vertex Matching Engine)
- Ensure security, compliance, and uptime of infrastructure supporting safety-critical systems
0 views 0 saves 0 applications