We are looking for an experienced Senior Systems Engineer who feels comfortable at the intersection of classical on-premises infrastructure and modern cloud-native technologies. You will be responsible for the reliability, automation and observability of our hybrid environments โ with a strong focus on VMware Cloud Foundation (VCF), Observability using the Grafana, Kubernetes cluster management, and automation workflows.
Your main responsibilities
- Design, operate and continuously improve the reliability & availability of our VMware Cloud Foundation (VCF) based platforms (on-prem and in interconnection with cloud environments)
- Manage and automate VMware landscapes (vSphere, NSX, vSAN, Aria Suite etc.) in large-scale hybrid/multi-cloud setups
- Implement and extend our observability stack based on Grafana
- Build, operate and scale Kubernetes clusters, including day-2 operations, upgrades, capacity management and security hardening
- Develop and maintain automation workflows primarily using Kestra (in conjunction with other tools such as Ansible, Terraform)
- Drive incident response, post-mortem culture, error budgets and toil reduction according to SRE principles