1h ago

Staff Site Reliability Engineer- Splunk Expert

Bengaluru, India
full-timesenior HybridIdentity and Access Management

Tech Stack

+2

Description

You will architect and evolve a comprehensive observability platform, moving beyond simple monitoring. As the Splunk subject-matter expert, you will optimize logging performance and cost, treat infrastructure as code using Terraform and Go/Python/Ruby, and build automated workflows to reduce incident resolution time.

Requirements

  • 8+ years in SRE, DevOps, or Systems Engineering with high-availability systems.
  • Deep hands-on Splunk administration and search optimization (SPL) experience.
  • Proven ability to build actionable Grafana dashboards.
  • Strong coding skills in Go, Python, or Ruby for automation.
  • Experience with OpenTelemetry, Prometheus, Linux internals, Kubernetes/EKS.

Responsibilities

  • Lead Splunk environment design and tuning for indexer performance, search efficiency, and cost optimization.
  • Architect and maintain sophisticated Grafana dashboards correlating disparate data sources.
  • Design and build scalable observability infrastructure using Terraform.
  • Optimize telemetry data collection, processing, and storage (Metrics, Logs, Traces).
  • Develop custom Splunk workflows and integrations for automated incident response.
0 views 0 saves 0 applications