3h ago

Member of Technical Staff - Observability

Palo Alto, CA

$180k-$440k / year

full-timeArtificial Intelligence

🛠 Tech Stack

💼 About This Role

You'll build and maintain the observability platform powering metrics, logs, tracing, and alerting at xAI. Your work will enable engineering teams to operate services at scale and identify issues before they impact users. This role offers the chance to handle telemetry at massive scale with strict performance requirements.

🎯 What You'll Do

  • Design and implement scalable observability infrastructure for metrics, logging, and tracing.
  • Build high-performance telemetry pipelines that handle massive ingestion volumes.
  • Develop APIs, query engines, and UIs for real-time service insights.
  • Define and enforce best practices for instrumentation and alerting.

📋 Requirements

  • Production-level proficiency in Go, Rust, Scala, or similar languages
  • Deep understanding of distributed systems and telemetry architecture
  • Experience building and operating infrastructure at scale
  • Familiarity with observability stacks such as Prometheus, Grafana, OpenTelemetry, VictoriaMetrics, or ClickHouse

✨ Nice to Have

  • Experience with Kafka, Redis, or large-scale time series databases
  • Experience operating observability pipelines in Kubernetes or similar orchestration environments

🎁 Benefits & Perks

  • 💰 Equity included in total rewards
  • 🩺 Comprehensive medical, vision, and dental coverage
  • 🏦 401(k) retirement plan
  • 🛡️ Short- and long-term disability insurance
  • 📋 Life insurance and various discounts/perks
0 0 0