Login Sign Up

Engineer_Data Engineer

Arrow

2 - 5 years

Pune

Posted: 13/09/2025

Getting a referral is 5x more effective than applying directly

Job Description

Position:

Engineer_Data Engineer

Job Description:

Job Title: Data Engineer Role Summary Build and operate scalable, reliable data pipelines on Azure. Develop batch and streaming ingestion, transform data using Databricks (PySpark/SQL), enforce data quality, and publish curated datasets for analytics and ML. Key Responsibilities · Design, build, and maintain ETL/ELT pipelines in Azure Data Factory and Databricks across Bronze → Silver → Gold layers. · Implement Delta Lake best practices (ACID, schema evolution, MERGE/upsert, time travel, Z-ORDER). · Write performant PySpark and SQL; tune jobs (partitioning, caching, join strategies). · Create reusable components; manage code in Git; contribute to CI/CD pipelines (Azure DevOps/GitHub Actions/Jenkins). · Apply data quality checks (Great Expectations or custom validations), monitoring, drift detection, and alerting. · Model data for analytics (star/dimensional); publish to Synapse/Snowflake/SQL Server. · Uphold governance and security (Purview/Unity Catalog lineage, RBAC, tagging, encryption, PII handling). · Author documentation/runbooks; support production incidents and root-cause analysis; suggest cost/performance improvements. Must-Have (Mandatory) · Data Engineering & Pipelines o Hands-on experience building production pipelines with Azure Data Factory and Databricks (PySpark/SQL). o Working knowledge of Medallion Architecture and Delta Lake (schema evolution, ACID). · Programming & Automation o Strong Python (pandas/PySpark) and SQL. o Practical Git workflow; experience integrating pipelines into CI/CD (Azure DevOps/GitHub Actions/Jenkins). o Familiarity with packaging reusable code (e.g., Python wheels) and configuration-driven jobs. · Data Modeling & Warehousing o Solid grasp of dimensional modeling/star schemas; experience with Synapse, Snowflake, or SQL Server. · Data Quality & Monitoring o Implemented validation checks and alerts; exposure to drift detection and pipeline observability. · Cloud Platforms (Azure preferred) o ADLS Gen2, Key Vault, Databricks, ADF basics (linked services, datasets, triggers), environment promotion. · Data Governance & Security o Experience with metadata/lineage (Purview/Unity Catalog), RBAC, secrets management, and secure data sharing. o Understanding of PII/PHI handling and encryption at rest/in transit. · Collaboration

Candidate Roles and Responsibilities

 

 

Technical Skills required:

Must have Skills:

Skills Beginner Intermediate Expert

Azure Data Factory

and Databricks

(PySpark/SQL) Yes

Python

(pandas/PySpark) Yes

SQL Yes

CI/CD (Azure

DevOps/GitHub

Actions/Jenkins). Yes

exposure to drift

detection and

pipeline observability Yes

Cloud platform [Azure

preferred: ADLS

Gen2, Key Vault,

Databricks, ADF

basics] Yes

Data Governance &

Security Yes Good to have skills:

Skills Beginner Intermediate Expert

Databricks Asset

Bundles (DAB) Yes

o Clear communication, documentation discipline, Agile ways of working, and code reviews. Good-to-Have o Databricks Asset Bundles (DAB) for environment promotion/infra-as-code style deployments. o Streaming/real-time: Kafka/Event Hubs; CDC tools (e.g., Debezium, ADF/Synapse CDC). o MLOps touchpoints: MLflow tracking/registry, feature tables, basic model-inference pipelines. o Power BI exposure for publishing curated tables and building operational KPIs. o DataOps practices: automated testing, data contracts, lineage-aware deployments, cost optimization on Azure. o Certifications: Microsoft Certified — Azure Data Engineer Associate (DP-203) or equivalent. Qualifications · 4–6 years of professional experience in data engineering (or equivalent project depth). · Bachelor’s/Master’s in CS/IT/Engineering or related field (or equivalent practical experience)

Location:

IN-MH-Pune, India-Baner (eInfochips)

Time Type:

Full time

Job Category:

Engineering Services

About Company

Arrow Electronics is a Fortune 500 technology company that specializes in providing electronic components and enterprise IT solutions. Headquartered in Centennial, Colorado, it supports over 220,000 customers across 80+ countries. Arrow helps businesses design, build, and manage innovative technology products through its global distribution, engineering, and supply chain services.

Services you might be interested in

We Search & Apply Jobs for You!

Our team scans through 1000s of opportunities and applies to roles best suited to your profile

Save 100+ hours and focus on what matters - cracking interviews and landing offers.