We are on the hunt for a dynamic and proficient Cloud Data Engineer to join our Guardant Data Platform within the Data Team.
Duties and Responsibilities
- Quickly learn and adapt to new technologies as the Data Team's technology stack evolves,demonstrating the ability to tackle new challenges.
- Consider all aspects of usability, scalability, deployment, integration, maintenance, andautomation when integrating new technology stacks.
- Demonstrate strong programming skills in at least one language (Python, Scala, Java) and theability to learn additional languages as needed.
- Build and maintain ETL pipelines and data-driven systems utilizing technologies such asApache Spark, AWS Glue, Athena, Redshift, and AWS Batch.
- Expertise in writing complex SQL queries is essential.
- Manage code on GitHub, with a comprehensive understanding of advanced git operations,including git-flow, rebasing, and squashing.
- Implement infrastructure as code using Terraform and utilize AWS Analytics and DataServices like Glue, S3, Lambda, AWS Batch, Athena, Redshift, DynamoDB, CloudWatch,Kinesis, SQS, SNS, and DMS.
- Use Jenkins to implement deployment pipelines and engage in requirements gathering toestimate efforts for integrating new technology stacks.
- Design and architect solutions for ML, Data Governance, Deployment/IntegrationAutomations, and Data Analytics.
- Explore and learn additional AWS services such as ECS, ECR, and EC2, along with Data Modeling.