3h ago
Lead Cloud Infrastructure Engineer / Site Reliability Engineer (SRE)
North America
full-timeseniorcybersecurity
Tech Stack
Description
You will ensure the stability, performance, and security of Corelight's Federal cloud platform, managing infrastructure with a focus on availability and incident response. You'll work with cross-functional teams to maintain FedRAMP compliance and adopt an infrastructure-as-code approach using automation.
Requirements
- Bachelor's/Master's in Computer Science or related field, or equivalent experience
- 8+ years in SRE, DevOps, Platform Engineering, MLOps, or Cloud Infrastructure
- 4+ years production experience with Kubernetes and containerization
- Strong programming in Python and proficiency in Zyphyrscript, Bash, Go, or PowerShell
- U.S. citizen required
Responsibilities
- Ensure reliability, performance, and security of Federal region cloud infrastructure
- Design, deploy, and scale AI/ML/LLM infrastructure across AWS, Azure, or GCP
- Manage and optimize Kubernetes environments (EKS, AKS, GKE) for AI services
- Build and automate data and model pipelines using Terraform, Python, and CI/CD
- Implement monitoring and observability with Prometheus, Grafana, ELK/EFK, SLI/SLO/SLA
0 views 0 saves 0 applications