Principal Consultant - Data Engineer (AI/ML)
Genpact
5 - 10 years
Bengaluru
Posted: 3/5/2025
Job Description
Responsibilities
:Build Spark pipelines required for extraction, transformation and loading the data from wide variety of sources using pyspark and SQL
Ability to Schedule and merge dependent Airflow jobs
Collaborate with analytics and business teams to improve data models that feed business intelligence tools, increasing data accessibility
Implement processes and systems to monitor data quality, ensuring production data is accurate and available
Work closely with engineering teams to develop strategy for long term data platform architecture
Design and develop machine learning and deep learning systems.
Requirements:
Bachelors degree in Computer Science, IT, engineering, or a related field.
Experience working with high-performance, distributed and in-memory systems
Working knowledge of pyspark and airflow, preferably on Databricks Platform
Working knowledge of Open Source Trino (Presto) or Starburst
Working knowledge in Python and other Object-Oriented languages
Working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases (Postgres, Oracle, Vertica, Delta-lake)
Working knowledge of Kafka, preferably Commercial version of Confluent
Working knowledge of AWS or any other cloud services (rds, msk, s3, etc.)
Knowledge of Vertica Columnar Database
Familiarity with machine learning libraries/frameworks (keras, pytorch, scikit-learn), plotting libraries (matplotlib, seaborn, Plotly), and Jupyter notebooks.
Knowledge of a variety of machine learning techniques (clustering, decision tree learning, artificial neural networks, etc.) and their real-world advantages/drawbacks.
Knowledge of Gan-AI with small scale project success.
Experience with or knowledge of Agile methodologies such as SCRUM.
About Company
Genpact is a global professional services firm that offers a wide range of digital transformation services and solutions. With a presence in over 30 countries, Genpact leverages its deep domain expertise in operations and analytics to help businesses transform their operations, improve efficiency, and enhance customer experience. The company combines digital technology, data science, and operational excellence to deliver business outcomes across various industries, including banking, insurance, manufacturing, and healthcare. Founded in 1997 as a subsidiary of GE, Genpact has grown into an independent, NYSE-listed company with a diverse workforce of over 90,000 employees globally.
Services you might be interested in
One-Shot Campaign
Reach out to ideal employees in one shot!
The intelligent campaign for reaching out to the ideal audience to whom you can ask for help (guidance or referral).