We are looking for an experienced Senior DevOps Engineer to join our cloud operations organization. In this role, you will work within a 100% Google Cloud Platform (GCP) environment, collaborating closely with our team of Database Administrators, as well as the streaming (Kafka) and OS support groups. You will be responsible for automating infrastructure, maintaining system reliability, and bridging the gap between operations and database management through effective scripting and tooling.
Responsibilities:
- Design, deploy, and manage scalable infrastructure on Google Cloud Platform (GCP).
- Collaborate closely with the Database Administration team (5+ DBAs) to ensure high availability and performance of data systems.
- Automate operational workflows and maintenance tasks using Python (primary team standard) or Shell/Bash.
- Partner with the streaming/OS group to support Kafka clusters and underlying operating systems.
- Implement and maintain CI/CD pipelines for seamless deployment.
- Monitor system health, troubleshoot complex infrastructure issues, and ensure rapid incident resolution.
- Document infrastructure designs, automation scripts, and operational procedures.
- Follow change management processes for critical production environments.