🔔 FCM Loaded

Engineer_Data Engineer

Arrow

2 - 5 years

Pune

Posted: 13/09/2025

Getting a referral is 5x more effective than applying directly

Job Description

Position:

Engineer_Data Engineer

Job Description:

Job Title: Data Engineer Role Summary Build and operate scalable, reliable data pipelines on Azure. Develop batch and streaming ingestion, transform data using Databricks (PySpark/SQL), enforce data quality, and publish curated datasets for analytics and ML. Key Responsibilities · Design, build, and maintain ETL/ELT pipelines in Azure Data Factory and Databricks across Bronze → Silver → Gold layers. · Implement Delta Lake best practices (ACID, schema evolution, MERGE/upsert, time travel, Z-ORDER). · Write performant PySpark and SQL; tune jobs (partitioning, caching, join strategies). · Create reusable components; manage code in Git; contribute to CI/CD pipelines (Azure DevOps/GitHub Actions/Jenkins). · Apply data quality checks (Great Expectations or custom validations), monitoring, drift detection, and alerting. · Model data for analytics (star/dimensional); publish to Synapse/Snowflake/SQL Server. · Uphold governance and security (Purview/Unity Catalog lineage, RBAC, tagging, encryption, PII handling). · Author documentation/runbooks; support production incidents and root-cause analysis; suggest cost/performance improvements. Must-Have (Mandatory) · Data Engineering & Pipelines o Hands-on experience building production pipelines with Azure Data Factory and Databricks (PySpark/SQL). o Working knowledge of Medallion Architecture and Delta Lake (schema evolution, ACID). · Programming & Automation o Strong Python (pandas/PySpark) and SQL. o Practical Git workflow; experience integrating pipelines into CI/CD (Azure DevOps/GitHub Actions/Jenkins). o Familiarity with packaging reusable code (e.g., Python wheels) and configuration-driven jobs. · Data Modeling & Warehousing o Solid grasp of dimensional modeling/star schemas; experience with Synapse, Snowflake, or SQL Server. · Data Quality & Monitoring o Implemented validation checks and alerts; exposure to drift detection and pipeline observability. · Cloud Platforms (Azure preferred) o ADLS Gen2, Key Vault, Databricks, ADF basics (linked services, datasets, triggers), environment promotion. · Data Governance & Security o Experience with metadata/lineage (Purview/Unity Catalog), RBAC, secrets management, and secure data sharing. o Understanding of PII/PHI handling and encryption at rest/in transit. · Collaboration

Candidate Roles and Responsibilities

 

 

Technical Skills required:

Must have Skills:

Skills Beginner Intermediate Expert

Azure Data Factory

and Databricks

(PySpark/SQL) Yes

Python

(pandas/PySpark) Yes

SQL Yes

CI/CD (Azure

DevOps/GitHub

Actions/Jenkins). Yes

exposure to drift

detection and

pipeline observability Yes

Cloud platform [Azure

preferred: ADLS

Gen2, Key Vault,

Databricks, ADF

basics] Yes

Data Governance &

Security Yes Good to have skills:

Skills Beginner Intermediate Expert

Databricks Asset

Bundles (DAB) Yes

o Clear communication, documentation discipline, Agile ways of working, and code reviews. Good-to-Have o Databricks Asset Bundles (DAB) for environment promotion/infra-as-code style deployments. o Streaming/real-time: Kafka/Event Hubs; CDC tools (e.g., Debezium, ADF/Synapse CDC). o MLOps touchpoints: MLflow tracking/registry, feature tables, basic model-inference pipelines. o Power BI exposure for publishing curated tables and building operational KPIs. o DataOps practices: automated testing, data contracts, lineage-aware deployments, cost optimization on Azure. o Certifications: Microsoft Certified — Azure Data Engineer Associate (DP-203) or equivalent. Qualifications · 4–6 years of professional experience in data engineering (or equivalent project depth). · Bachelor’s/Master’s in CS/IT/Engineering or related field (or equivalent practical experience)

Location:

IN-MH-Pune, India-Baner (eInfochips)

Time Type:

Full time

Job Category:

Engineering Services

About Company

Arrow Electronics is a Fortune 500 technology company that specializes in providing electronic components and enterprise IT solutions. Headquartered in Centennial, Colorado, it supports over 220,000 customers across 80+ countries. Arrow helps businesses design, build, and manage innovative technology products through its global distribution, engineering, and supply chain services.

Services you might be interested in

File Your ITR Now

Don’t wait for the deadline to stress you out!

Smart, fast, and reliable ITR filing for 2024-25. Submit your details today.