We are looking for a skilled and motivated Lead Engineer to join our Data Science and Delivery group at Clario, a part of Thermo Fisher Scientific. This role combines software development, data engineering, and analytical problemâsolving to design, build, and maintain scalable data platforms that support clinical trial operations and business intelligence. You will work across the full software development lifecycle (SDLC)âfrom requirements gathering through production supportâcollaborating closely with data scientists, analysts, product managers, and engineering teams to deliver highâquality, dataâdriven solutions.
What We Offer
Competitive compensation aligned with local market practices
Comprehensive health and wellness benefits
Paid time off and company holidays
Opportunities for professional development, learning, and career growth
The flexibility of working from Bangalore or remotely within India, while collaborating with global teams
What Youâll Be Doing
Design, develop, and maintain scalable software architectures and data pipelines that integrate with analytical and operational systems.
Write clean, reusable, and wellâtested Python code using frameworks such as Flask and related libraries.
Leverage AIâassisted development tools, including GitHub Copilot and LangChain, to design, build, and integrate LLMâpowered solutions such as retrievalâaugmented generation (RAG) pipelines, intelligent agents, and automated workflows using AWS Bedrock or similar services.
Develop and optimize complex SQL across Oracle, MS SQL Server, PostgreSQL, and Snowflake, including procedures, functions, views, analytical functions, and dynamic SQL.
Design and implement ETL pipelines using Snowflake and related data processing technologies.
Implement scheduling and orchestration using Apache Airflow or similar workflow orchestration frameworks.
Establish and maintain data quality frameworks, versioning, and governance practices to ensure data reliability, integrity, and compliance.
Develop and maintain data architectures and models for both structured and unstructured data sources.
Troubleshoot production issues and drive continuous improvement in software quality, performance, and reliability.
Deploy, manage, and support solutions on AWS, including storage, compute, and pipeline services.
Create sourceâtoâtarget mappings and support data and code migration initiatives.
Partner with stakeholders to gather requirements, translate business needs into technical solutions, and produce clear, wellâstructured documentation.
Collaborate with product managers, analysts, and crossâfunctional teams to deliver dataâdriven insights and reporting using tools such as Plotly and Power BI.
What We Look For
Bachelorâs or higher degree in Computer Science, Information Technology, or a related technical field.
5+ years of professional experience in software engineering, data engineering, or dataâfocused development roles.
Strong proficiency in Python, including frameworks and libraries such as Django or Flask, pandas, NumPy, Plotly, and agâGrid.
Strong SQL expertise with Oracle, MS SQL Server, PostgreSQL, and/or Snowflake.
Proven experience writing complex SQL, including analytical and window functions, subqueries, all join types, DML/DDL/TCL statements, CASE expressions, and performance tuning.
Working knowledge of cloud platforms, with a preference for AWS (S3, EC2, Secrets Manager, Bedrock, Lambda).
Experience using AIâassisted development tools and frameworks such as GitHub Copilot and LangChain for building LLMâpowered applications and workflows.
Experience with Gitâbased version control systems and CI/CD pipelines.
Familiarity with data modeling concepts for both structured and unstructured data.
Strong analytical thinking, problemâsolving abilities, and communication skills.
Willingness to work across all phases of the SDLC, including requirements gathering, design, development, deployment, and production support.
Preferred experience includes exposure to the clinical trial lifecycle or clinical data management, data visualization tools (Plotly, Power BI), frontâend technologies (HTML5, CSS3, JavaScript), collaboration tools (Jira, Confluence, Microsoft Teams), and handsâon data analysis or data cleansing using programming languages, SQL, and Excel.
At Clario, our purpose is to transform lives by unlocking better evidence. Itâs a cause that unites and inspires us. Itâs why we come to workâand how we empower our people to make a positive impact every day. Whether youâre starting your clinical data career or building longâterm expertise, your work helps bring lifeâchanging therapies to patients faster.
clarioclinical