Lead Consultant - Data Engineer

Genpact

5 - 10 years

Bengaluru

Posted: 17/12/2024

Job Description

Responsibilities

Overall experience on Apache Spark, SQL, DBT, Databricks, Hive, HDFS, Airflow, Kafka required.

Develop SQL based data processing pipelines which includes, but not limited, SQL

functions, tables, and views.

Implement defined technology best practices and guidelines.

Create data validation rules on source data to confirm the data has correct and/or

expected values.

Perform data profiling of source data to identify data quality issues and anomalies,

business knowledge embedded in data, gathering of natural keys, and metadata

information.

Leverage Databricks to optimize and accelerate processing and analytics tasks.

Collaborate with Databricks Administrators and Platform engineers to ensure optimal

performance of data processing pipelines executed on Databricks.

Implement data quality rules and data governance policies to ensure data accuracy and

consistency.

About Company

Genpact is a global professional services firm that offers a wide range of digital transformation services and solutions. With a presence in over 30 countries, Genpact leverages its deep domain expertise in operations and analytics to help businesses transform their operations, improve efficiency, and enhance customer experience. The company combines digital technology, data science, and operational excellence to deliver business outcomes across various industries, including banking, insurance, manufacturing, and healthcare. Founded in 1997 as a subsidiary of GE, Genpact has grown into an independent, NYSE-listed company with a diverse workforce of over 90,000 employees globally.

Services you might be interested in

One-Shot Campaign

Reach out to ideal employees in one shot!

The intelligent campaign for reaching out to the ideal audience to whom you can ask for help (guidance or referral).