Role Summary
Syngenta is looking for a proactive and driven Data Architect to join our cloud and Data Ops team. In this role, you will work on designing the system architecture and solution, ensuring the platform is scalable while performant, and creating automated data pipelines.
Responsibilities:
- Design and lead implementation of end-to-end Databricks Lakehouse Platforms using Delta Lake, Delta Live Tables, and MLflow.
- Architect Medallion Architecture (Bronze/Silver/Gold) for structured, semi-structured, and streaming workloads.
- Implement governed Lakehouse patterns using Unity Catalog for access control, lineage, data classification, and secure sharing.
- Build scalable ETL/ELT pipelines using Databricks Notebooks, Workflows, SQL Warehouses, and Spark-based transformations.
- Develop real-time streaming pipelines with Auto Loader, Structured Streaming, and event-driven platforms (Kafka, Kinesis, Pub/Sub).
- Integrate Databricks with cloud-native services such as AWS Glue, Azure Data Factory, and GCP Dataform.
- Define distributed integration patterns using REST APIs, microservices, and event-driven architectures.
- Enforce data governance, RBAC/ABAC, encryption, secret management, and compliance controls.
- Optimize Delta Lake tables, Spark workloads, and cluster configurations using Photon and autoscaling patterns.
- Drive cloud cost optimization across storage, compute, and workflow orchestration.
- Participate in architecture reviews, set standards, and support engineering teams throughout execution.
- Stay current on Databricks capabilities including Unity Catalog updates, Lakehouse Federation, serverless compute, and AI/ML features.