Login Sign Up
🔔 FCM Loaded

Python Developer for Data Engineering

MSCI

5 - 8 years

Mumbai

Posted: 07/03/2026

Getting a referral is 5x more effective than applying directly

Job Description

Your Team Responsibilities

The Data Engineering group within MSCI is responsible for delivering data products to MSCI's product lines globally. We're building a next-generation metadata-driven data platform that uses AI to automate and scale data onboarding. As part of a talented software development team in Mumbai, you'll have the opportunity to design and build state-of-the-art automation solutions that transform how MSCI ingests, governs, and delivers data to the investment community. This is a greenfield opportunity to architect systems that will reduce manual effort by 40-60% and accelerate vendor onboarding by 50-70%.

Your Key Responsibilities

  • Design and develop AI-powered automation capabilities for data onboarding, including vendor file classification, metadata auto-suggestion, data profiling engines, and automated quality control frameworks
  • Build LLM-integrated code generation systems for ingestion pipelines, PySpark transformations, and Airflow DAG orchestration
  • Implement metadata management platforms that serve as the control plane for data lifecycle automation
  • Develop RESTful APIs and integration layers connecting AI services, data platforms (Snowflake, Databricks), and orchestration frameworks
  • Create human-in-the-loop workflows for validation, exception handling, and continuous model improvement
  • Collaborate with cross-functional teams across data engineering, governance, and product to deliver end-to-end automation solutions
  • Build scalable, fault-tolerant systems designed for metadata processing at scale

Your skills and experience that will help you excel

Required:

  • 5-8 years of software development experience with strong Python programming expertise
  • Knowledge of data manipulation libraries (Pandas, Polars) and analysis workflows
  • Proficiency in SQL and data querying across modern data platforms
  • Understanding of columnar storage formats and time-series analytics (ClickHouse, Parquet, Iceberg)
  • Experience with AI-assisted development tools (GitHub Copilot, Cursor, or similar)
  • Strong understanding of RESTful API design and implementation
  • Experience with Git version control and collaborative development workflows
  • Demonstrated ability to take ownership of complex technical solutions end-to-end
  • Strong analytical and problem-solving skills with attention to data quality and reliability

Preferred:

  • Hands-on experience with LLMs (OpenAI, Anthropic, or open-source models) and prompt engineering
  • Familiarity with Agentic AI frameworks and orchestration patterns
  • Experience with cloud platforms (Azure or GCP) and modern data infrastructure
  • Exposure to financial concepts, reference data, or market data processing
  • Experience building metadata-driven or self-service data platforms

About Company

MSCI Inc. is a leading global provider of critical decision-support tools and services for the investment community. The company is best known for its market indexes, such as the MSCI World and MSCI Emerging Markets Indexes, which are widely used as benchmarks by asset managers and institutional investors worldwide. In addition to indexes, MSCI offers portfolio risk and performance analytics, real estate data, and environmental, social, and governance (ESG) research to help clients make informed investment decisions. With a strong presence across major financial markets, MSCI plays a pivotal role in shaping investment strategies and facilitating transparency in global capital markets.

Services you might be interested in

Improve Your Resume Today

Boost your chances with professional resume services!

Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.