Python Developer for Data Engineering
MSCI
5 - 8 years
Mumbai
Posted: 07/03/2026
Job Description
Your Team Responsibilities
The Data Engineering group within MSCI is responsible for delivering data products to MSCI's product lines globally. We're building a next-generation metadata-driven data platform that uses AI to automate and scale data onboarding. As part of a talented software development team in Mumbai, you'll have the opportunity to design and build state-of-the-art automation solutions that transform how MSCI ingests, governs, and delivers data to the investment community. This is a greenfield opportunity to architect systems that will reduce manual effort by 40-60% and accelerate vendor onboarding by 50-70%.
Your Key Responsibilities
- Design and develop AI-powered automation capabilities for data onboarding, including vendor file classification, metadata auto-suggestion, data profiling engines, and automated quality control frameworks
- Build LLM-integrated code generation systems for ingestion pipelines, PySpark transformations, and Airflow DAG orchestration
- Implement metadata management platforms that serve as the control plane for data lifecycle automation
- Develop RESTful APIs and integration layers connecting AI services, data platforms (Snowflake, Databricks), and orchestration frameworks
- Create human-in-the-loop workflows for validation, exception handling, and continuous model improvement
- Collaborate with cross-functional teams across data engineering, governance, and product to deliver end-to-end automation solutions
- Build scalable, fault-tolerant systems designed for metadata processing at scale
Your skills and experience that will help you excel
Required:
- 5-8 years of software development experience with strong Python programming expertise
- Knowledge of data manipulation libraries (Pandas, Polars) and analysis workflows
- Proficiency in SQL and data querying across modern data platforms
- Understanding of columnar storage formats and time-series analytics (ClickHouse, Parquet, Iceberg)
- Experience with AI-assisted development tools (GitHub Copilot, Cursor, or similar)
- Strong understanding of RESTful API design and implementation
- Experience with Git version control and collaborative development workflows
- Demonstrated ability to take ownership of complex technical solutions end-to-end
- Strong analytical and problem-solving skills with attention to data quality and reliability
Preferred:
- Hands-on experience with LLMs (OpenAI, Anthropic, or open-source models) and prompt engineering
- Familiarity with Agentic AI frameworks and orchestration patterns
- Experience with cloud platforms (Azure or GCP) and modern data infrastructure
- Exposure to financial concepts, reference data, or market data processing
- Experience building metadata-driven or self-service data platforms
About Company
MSCI Inc. is a leading global provider of critical decision-support tools and services for the investment community. The company is best known for its market indexes, such as the MSCI World and MSCI Emerging Markets Indexes, which are widely used as benchmarks by asset managers and institutional investors worldwide. In addition to indexes, MSCI offers portfolio risk and performance analytics, real estate data, and environmental, social, and governance (ESG) research to help clients make informed investment decisions. With a strong presence across major financial markets, MSCI plays a pivotal role in shaping investment strategies and facilitating transparency in global capital markets.
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
