🔔 FCM Loaded

Python Developer

Sourcebae

2 - 5 years

Pune

Posted: 19/02/2026

Getting a referral is 5x more effective than applying directly

Job Description

Lead SDET Data Engineering / Data QA (Databricks & PySpark)


Location: Indore / Pune

Experience: 1011 Years (Relevant 57 years hands-on)

Compensation: Highly Cometitive

Employment Type: Full-Time

Joining: Immediate / Serving Notice Period Candidates Preferred


Role Overview

We are looking for a hands-on Lead SDET / Data QA Engineer with strong expertise in data validation and data pipeline testing within modern data platforms. This role is execution-focused, similar to development work, and is ideal for professionals who enjoy validating complex data transformations built by data engineers.

The role is positioned as a Tester organizationally but requires deep understanding of data engineering concepts, SQL, and PySpark-based pipelines. This is not a managerial role; however, slight team lead exposure is preferred for future scaling.


Key Responsibilities

Data Validation & Pipeline Testing

Validate large-scale data pipelines built using Azure Data Factory and Databricks

Perform deep data validation across multiple layers (raw curated consumption)

Validate data transformations, joins, aggregations, and business logic implemented by data

engineers

Perform SQL-based reconciliation, data completeness, and accuracy checks

Review data pipeline logic and collaborate with data engineers to identify defects early


Execution-Focused Testing

Write and execute PySpark / Python-based test logic for validating transformations

Perform heavy-weight testing comparable to development work (not basic QA checks)

Debug pipeline failures and analyze root causes at data and code level

Ensure data correctness across batch pipelines (streaming optional, not mandatory)


CI/CD & Platform Collaboration

Integrate testing workflows into Azure DevOps CI/CD pipelines

Validate data outputs post-deployment across environments

Work closely with data engineers, platform teams, and stakeholders to ensure release quality


Leadership (Lightweight)

Provide technical guidance to junior testers (if assigned)

Review test logic and validation queries

Help standardize data QA best practices (process can be trained, coding cannot)



Mandatory Skills & Technologies

Data Engineering Testing Experience: 57 years (hands-on)

SQL: Strong (joins, aggregations, reconciliation, validation queries)

Databricks: Hands-on experience

PySpark / Python: Strong working knowledge (writing validation logic)

Azure Data Factory (ADF): Pipeline understanding and validation

Azure DevOps: CI/CD pipeline integration

Strong understanding of data engineering concepts and pipeline architecture

Services you might be interested in

Improve Your Resume Today

Boost your chances with professional resume services!

Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.