What you get to do in this role:
- Drive deep observability and service level management initiatives (SLIs/SLOs) across infrastructure and platform services.
- Use your expertise in software development, systems engineering, and networking to proactively prevent repeatable issues and eliminate toil at its source.
- Lead reliability engineering engagements with partner teams to enable them to build, maintain, and improve their products with confidence.
- Champion resilient architecture design and operationalization practices across the engineering organization.
- Drive a culture of engineering excellence by building scalable, automated solutions that embed reliability at the point of development.
- Develop and advance monitoring, automation, and reliability frameworks that raise the maturity of services organization-wide.