The Data Reliability Engineer supports the design, implementation, and ongoing improvement of cloud-native, containerized infrastructure that powers our data products and services. This role contributes to the reliability, scalability, security, and operational health of our data ecosystem while working closely with more experienced engineers and cross-functional teams.
In this role, you will help maintain and enhance platforms and services used across the Data organization. You will participate in day-to-day engineering efforts, support production operations, assist with automation and infrastructure improvements, and contribute to the successful delivery of data platform capabilities.
You will work with a modern data stack that includes Data Services running on EKS, such as Superset, Trino, Apache Doris ClickHouse, OpenMetadata and other platform components, as well as Databricks (Spark jobs), Airflow, and other Big Data technologies. The team is also expanding into new areas, including helping deploy and implement AI agents, providing opportunities to contribute to innovative solutions at the intersection of data and AI.
This is a strong opportunity for an engineer with a solid technical foundation who is eager to grow skills in cloud infrastructure, platform engineering, data systems, and reliability practices in a collaborative environment
This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.