3h ago

Lead Cloud Infrastructure Engineer / Site Reliability Engineer (SRE)

North America
full-timeseniorcybersecurity

Tech Stack

Description

You will ensure the stability, performance, and security of Corelight's Federal cloud platform, managing infrastructure with a focus on availability and incident response. You'll work with cross-functional teams to maintain FedRAMP compliance and adopt an infrastructure-as-code approach using automation.

Requirements

  • Bachelor's/Master's in Computer Science or related field, or equivalent experience
  • 8+ years in SRE, DevOps, Platform Engineering, MLOps, or Cloud Infrastructure
  • 4+ years production experience with Kubernetes and containerization
  • Strong programming in Python and proficiency in Zyphyrscript, Bash, Go, or PowerShell
  • U.S. citizen required

Responsibilities

  • Ensure reliability, performance, and security of Federal region cloud infrastructure
  • Design, deploy, and scale AI/ML/LLM infrastructure across AWS, Azure, or GCP
  • Manage and optimize Kubernetes environments (EKS, AKS, GKE) for AI services
  • Build and automate data and model pipelines using Terraform, Python, and CI/CD
  • Implement monitoring and observability with Prometheus, Grafana, ELK/EFK, SLI/SLO/SLA
0 views 0 saves 0 applications