The Senior SRE will ensure the reliability, performance, and scalability of mission‑critical systems. You’ll combine software engineering and operations expertise to improve observability, automate operations, and reduce downtime.
Build and maintain monitoring, alerting, and observability systems
Improve system reliability through automation and performance tuning
Drive incident response, root‑cause analysis, and post‑mortems
Collaborate with Platform, Automation, and Security teams
Develop tools to reduce manual operational work
Ensure SLAs, SLOs, and SLIs are defined and met
Strong SRE or DevOps background in cloud environments
Experience with monitoring tools (Prometheus, Grafana, Azure Monitor, etc.)
Proficiency in scripting and automation
Strong understanding of distributed systems and reliability engineering
Dutch‑speaking
Hybrid: 1–2 days per week in Amstelveen