Data Engineer โ AI & Digital Platforms
Must-Have Skills
- Hadoop and MapReduce
- Cloudera
- AI-enabled Application Development
- Machine Learning โ General Experience
- LLM Application Frameworks (Capable)
Key Responsibilities
Full Stack Development
- Proficiency in Python, Shell scripting, REST APIs, and web frameworks (Flask, React).
Machine Learning & AI
- Hands-on experience with ML platforms (CML), Spark MLlib, and Python ML libraries (scikit-learn, XGBoost).
- Experience in operationalizing ML models at enterprise scale.
GenAI/LLM Applications
- Familiarity with building applications using large language models (OpenAI, Hugging Face, LangChain).
- Ability to build agent workflows and support users in creating agent-based solutions.
Security & Governance
- Experience with enterprise data security (LDAP, Kerberos, RBAC), data masking, and access control.
Performance Tuning
- Strong expertise in optimizing data applications and queries in Hadoop and Teradata environments.
Tools & Platforms
- Cloudera Data Platform (CDP), Informatica, QlikSense, Apache Oozie, Git, CI/CD pipelines.
Soft Skills
- Strong analytical and problem-solving skills.
- Excellent communication abilities.
- Ability to work effectively in cross-functional teams