We are building a nextâgeneration, businessâcentric data intelligence and AI foundation that fuels Financeâfrom FP&A intelligence to product, compliance, and controllership decisionâmaking. As a Data Engineer of the Finance Technology â Data Intelligence organization, you will architect scalable data pipelines, semantic models, and endâtoâend BI solutions, while safely operationalizing GenAI capabilities such as RAG, prompt engineering, evaluation frameworks, and agentâbased workflows. You will collaborate closely with analysts, data scientists, and engineering partners to deliver secure, reliable, auditable, and reusable data and AI services that materially enhance decision quality, automation, and speed across Finance.
Responsibilities:
Build the data foundation
Collaborate with Data Analysts, Data Scientists, Software Engineers, and cross-functional partners to design, build, and deploy scalable data pipelines that deliver highâquality, governed analytical datasets across Finance domains.
Engineer highâquality batch/streaming data pipelines (SQL/Hive/PySpark) across Lake/Lakehouse to power curated finance domain marts and a governed semantic layer.
Design dimensional/semantic models that enable selfâservice analytics (PowerâŻBI / Fabric semantic models / SSAS Tabular) with performant DAX measures and rowâlevel security.
Operationalize GenâŻAI for Finance
Ship productionâgrade GenâŻAI features (retrievalâaugmented generation, promptâchaining/agents) on governed datasets - implement vectorization strategies and chunking that respect PII/SOX controls.
Partner with DS/ML to train/fineâtune and evaluate models - harden prompt templates, guardrails, and content filters - track hallucination, toxicity, and retrieval metrics (precision/recall, hit@k).
Build reusable components (prompt libraries, evaluation harnesses, vector store abstractions) and integration SDKs/APIs for reuse across Finance use cases.
Platform, reliability & DevOps
Implement CI/CD for data & AI (Git, Azure DevOps/GitHub Actions), data quality tests (Great Expectations or equivalent), and model/data deployment automation (MLflow/Fabric/Azure ML).
Define observability (lineage, drift, freshness, cost) with alerts/SLAs - drive continuous hardening for performance (SQL/DAX tuning), cost efficiency, and reliability.
Analytics enablement
Deliver highâimpact dashboards/scorecards (PowerâŻBI/Tableau) and governed certified datasets - coach analysts on bestâpractice modeling and performance tuning.
Risk, governance & documentation
Embed privacyâbyâdesign (PII masking, purpose limitation), finance controls (SOX, audit trails), and robust documentation (runbooks, data dictionaries, model cards).
This is a hybrid position. Expectation of days in the office will be confirmed by your Hiring Manager.