Site Reliability Engineering:
The Site Reliability Engineering (SRE) team is responsible for ensuring the reliability, availability, and performance of Sitecore鈥檚 cloud-native products. It is a team of engineers with a mission to make reliability a first-class feature of our platforms, enabling Sitecore to deliver trusted, scalable digital experiences to millions of end users worldwide while supporting rapid product evolution.
The team works closely with product and platform engineering teams to define, measure, and improve service reliability through practices such as service level indicators and objectives (SLIs/SLOs), incident management, automation, and toil reduction. The SRE team focuses on building strong observability foundations, improving operational readiness, and driving a reliability-first engineering culture across Sitecore. By combining deep systems expertise with strong collaboration, the team helps ensure that Sitecore services remain highly available, resilient, and performant as they continue to grow and evolve.
About the Role:
We are seeking an experienced engineering manager who excels in operational strategy to scale our technology, processes and team effectiveness. In this role, you will focus on establishing robust operational processes, minimizing downtime, increasing reliability and streamlining delivery pipelines. Leading a skilled technical team, your priorities will be to optimize processes, establish clear operational metrics, and foster collaboration across R&D teams to enable seamless and reliable product delivery.
What You鈥檒l Do: