Role Description:
The R&D Data Catalyst Team is responsible for buildingData Searching, Cohort Building, and Knowledge Management tools that provide the Amgen scientific community with visibility to Amgenâs wealth of human datasets, projects and study histories, and knowledge over various scientific findings.     These solutions are pivotal tools in Amgenâs goal to accelerate the speed of discovery, and speed to market of advanced precision medications. Â
The Sr. Data Engineer will be responsible for the end-to-end development of an enterprise analytics and data mastering solution leveraging Databricks and Power BI. This role requiresexpertise in both data architecture and analytics, with the ability to create scalable, reliable, and high-performing enterprise solutions that research cohort-building and advanced research pipeline.The ideal candidate will have experience creating and surfacing large unifiedrepositories of human data, based on integrations from multiple repositories and solutions, and be exceptionally skilled with data analysis and profiling. Â
You will collaborate closely with stakeholders, product team members, and related IT teams, to design and implement data models, integrate data from various sources, and ensure best practices for data governance and security. The ideal candidate will have a strong background in data warehousing, ETL, Databricks, Power BI, and enterprise data mastering.
Roles & Responsibilities:
Design and build scalable enterprise analytics solutions using Databricks, Power BI, and other modern data tools.
Leverage data virtualization, ETL, and semantic layers to balance need for unification, performance, and data transformation with goal to reduce data proliferation
Break down features into work that aligns with the architectural direction runway
Participate hands-on in pilots and proofs-of-concept for new patterns
Create robust documentation from data analysis and profiling, and proposed designs and data logicÂ
Develop advanced sql queries to profile, and unify dataÂ
Develop data processing code in sql, along with semantic views to prepare data for reportingÂ
Develop PowerBI Models and reporting packages
Design robust data models, and processing layers, that support both analytical processing and operational reporting needs.
Design and develop solutions based on best practices for data governance, security, and compliance within Databricks and Power BI environments.
Ensure the integration of data systems with other enterprise applications, creating seamless data flows across platforms.
Develop and maintain Power BI solutions, ensuring data models and reports are optimized for performance and scalability.
Collaborate with stakeholders to define data requirements, functional specifications, and project goals.
Continuously evaluate and adopt new technologies and methodologies to enhance the architecture and performance of data solutions.
Basic Qualifications and Experience:
Masterâs degree with 4 to 6 years of experience in Product Owner / Platform Owner / Service Owner OR
Bachelorâs degree with 8 to 10 years of experience in Product Owner / Platform Owner / Service OwnerÂ
Functional Skills:
Must-Have Skills:
Minimum of 3 years of hands-on experience with BI solutions (Preferrable Power BI or Business Objects) including report development, dashboard creation, and optimization.
Minimum of 6 years of hands-on experience building Change-data-capture (CDC) ETL pipelines, data warehouse design and build, and enterprise-level data management.
Hands-on experience with Databricks, including data engineering, optimization, and analytics workloads.
Deep understanding of Power BI, including model design, DAX, and Power Query.
Proven experience designing and implementing data mastering solutions and data governance frameworks.
Expertise in cloud platforms (AWS), data lakes, and data warehouses.
Strong knowledge of ETL processes, data pipelines, and integration technologies.
Strong communication and collaboration skills to work with cross-functional teams and senior leadership.
Ability to assess business needs and design solutions that align with organizational goals.
Exceptional hands-on capabilities with data profiling, data transformation, data masteringÂ
Success in mentoring and training team members
Good-to-Have Skills:
Experience in developing differentiated and deliverable solutions
Experience with human data, ideally human healthcare data
Familiarity with laboratory testing, patient data from clinical care, HL7, FHIR, and/or clinical trial data management
Professional Certifications (please mention if the certification is preferred or mandatory for the role):
ITIL Foundation or other relevant certifications (preferred)
SAFe Agile Practitioner (6.0)
Microsoft Certified: Data Analyst Associate (Power BI) or related certification.
Databricks Certified Professional or similar certification.
Soft Skills:
Excellent analytical and troubleshooting skills
Deep intellectual curiosity
Highest degree of initiative and self-motivation
Strong verbal and written comm
amgen