4+ years of experience in Machine learning/Data Science, and Python. 4+ years of experience in NLP/GenAI/LLMs. Report to CES Innovation Lead and work under the supervision of the CES AI Science Lead.
• Work with ESG stakeholders to understand business problems and connect these problems with solvable data science solutions.
• Audit the different text data assets of the CES department and determine how to analyze these data assets for insights.
• Prepare high-quality training data with appropriate coverage of the ESG business domain.
• Apply and implement the latest natural language processing (NLP) research and approaches to solve business problems.
• Apply Large Language Models (both open source and managed versions) to build robust multiagentic retrieval augmented generation (RAG) pipelines.
• Stay current with the latest research advancements in the field and apply innovative solutions to complex language processing challenges.
Required Skills (Mandatory):
• At least four years work experience in machine learning, Generative AI (GenAI) with experience in at least two of the following deep learning frameworks: SciKit-Learn, TensorFlow, MLflow, PyTorch, etc.
• At least four years of experience in NLP and Large Language Models (LLMs) with a comprehensive understanding of the underlying theories and principles including expertise in areas such as tokenization, embedding techniques, sequence-to-sequence models, attention mechanisms, and transformer architectures.
• In-depth understanding of text analytics and NLP concepts such as lemmatization, word segmentation, part-of-speech, tagging, stemming, Named-Entity Recognition, word2Vec, Doc2Vec, etc.
• Proficient in Machine Translation and Optical Character Recognition (OCR) for complex document processing (PDF, Word Documents, Scanned Documents).
• Expertise in Python with strong object-oriented design and programming skills with familiarity with CI/CD concepts.
• Proficient with parallel processing APIs such as Apache Spark and PySpark.
• Excellent problem-solving and analytical skills