18h ago
Site Reliability Engineer
Paris, France
full-timemid Hybrid
🛠 Tech Stack
💼 About This Role
You'll join Scaleway's SRE team to build and maintain reliable, observable, and secure infrastructure for sovereign cloud services. You'll automate monitoring and incident response to ensure high availability for thousands of customers.
🎯 What You'll Do
- Build and optimize tooling for automated monitoring and incident remediation
- Troubleshoot high-impact production issues with engineering teams
- Participate in on-call rotation to ensure service continuity
- Implement and maintain observability solutions for infrastructure health
📋 Requirements
- Experience with Go, Python, or Rust
- Strong scripting skills in Bash and Python
- Hands-on experience with Linux systems (Ubuntu/Debian)
- Knowledge of networking (TCP/IP, DNS, BGP, load-balancing, IPv6)
✨ Nice to Have
- Familiarity with monitoring tools like Prometheus, Grafana, Elastic
- Experience with Infrastructure-as-Code (Ansible, Salt, AWX)
- Experience managing relational databases (PostgreSQL)
🎁 Benefits & Perks
- 🏡 Hybrid work up to 3 days remote per week
- 🍽️ Healthy meal service at headquarters and Swile card for regional sites
- 🏋️ Well-being commitments including gym access and daycare places
- 🌍 International environment with dozens of nationalities
- 🚀 Career & mobility opportunities within Iliad Group
📨 Hiring Process
Discovery call with recruiter (30 min), manager interview (45 min), technical interview (1h), head of tribe interview (45 min), HR interview with office tour.
0 0 0