Data Scientist
Kotak Mahindra Bank
3 - 5 years
Bengaluru
Posted: 06/12/2024
Job Description
What we offer
Our mission is simple – Building trust. Our customer's trust in us is not merely about the safety of their assets but also about how dependable our digital offerings are. That’s why, we at Kotak Group are dedicated to transforming banking by imbibing a technology-first approach in everything we do, with an aim to enhance customer experience by providing superior banking services. We welcome and invite the best technological minds in the country to come join us in our mission to make banking seamless and swift. Here, we promise you meaningful work that positively impacts the lives of many.
About our team
DEX is a central data org for Kotak Bank which manages entire data experience of Kotak Bank. DEX stands for Kotak’s Data Exchange. This org comprises of Data Platform, Data Engineering and Data Governance charter. The org sits closely with Analytics org. DEX is primarily working on greenfield project to revamp entire data platform which is on premise solutions to scalable AWS cloud-based platform. The team is being built ground up which provides great opportunities to technology fellows to build things from scratch and build one of the best-in-class data lake house solutions. The primary skills this team should encompass are Software development skills preferably Python for platform building on AWS; Data engineering Spark (pyspark, sparksql, scala) for ETL development, Advanced SQL and Data modelling for Analytics.
The org size is expected to be around 100+ member team primarily based out of Bangalore comprising of ~10 sub teams independently driving their charter.
As a member of this team, you get opportunity to learn fintech space which is most sought-after domain in current world, be a early member in digital transformation journey of Kotak, learn and leverage technology to build complex data data platform solutions including, real time, micro batch, batch and analytics solutions in a programmatic way and also be futuristic to build systems which can be operated by machines using AI technologies.
The data platform org is divided into 3 key verticals
Data Platform
This Vertical is responsible for building data platform which includes optimized storage for entire bank and building centralized data lake, managed compute and orchestrations framework including concepts of serverless data solutions, managing central data warehouse for extremely high concurrency use cases, building connectors for different sources, building customer feature repository, build cost optimization solutions like EMR optimizers, perform automations and build observability capabilities for Kotak’s data platform. The team will also be center for Data Engineering excellence driving trainings and knowledge sharing sessions with large data consumer base within Kotak.
Data Engineering
This team will own data pipelines for thousands of datasets, be skilled to source data from 100+ source systems and enable data consumptions for 30+ data analytics products. The team will learn and built data models in a config based and programmatic and think big to build one of the most leveraged data model for financial orgs. This team will also enable centralized reporting for Kotak Bank which cuts across multiple products and dimensions. Additionally, the data build by this team will be consumed by 20K + branch consumers, RMs, Branch Managers and all analytics usecases.
Data Governance
The team will be central data governance team for Kotak bank managing Metadata platforms, Data Privacy, Data Security, Data Stewardship and Data Quality platform.
If you’ve right data skills and are ready for building data lake solutions from scratch for high concurrency systems involving multiple systems then this is the team for you.
You day to day role will include
- Drive business decisions with technical input and lead the team.
- Design, implement, and support an data infrastructure from scratch.
- Manage AWS resources, including EC2, EMR, S3, Glue, Redshift, and MWAA.
- Extract, transform, and load data from various sources using SQL and AWS big data technologies.
- Explore and learn the latest AWS technologies to enhance capabilities and efficiency.
- Collaborate with data scientists and BI engineers to adopt best practices in reporting and analysis.
- Improve ongoing reporting and analysis processes, automating or simplifying self-service support for customers.
- Build data platforms, data pipelines, or data management and governance tools.
BASIC QUALIFICATIONS for Data Engineer/ SDE in Data
- Bachelor's degree in Computer Science, Engineering, or a related field
- 3-5 years of experience in data engineering
- Strong understanding of AWS technologies, including S3, Redshift, Glue, and EMR
- Experience with data pipeline tools such as Airflow and Spark
- Experience with data modeling and data quality best practices
- Excellent problem-solving and analytical skills
- Strong communication and teamwork skills
- Experience in at least one modern scripting or programming language, such as Python, Java, or Scala
- Strong advanced SQL skills
BASIC QUALIFICATIONS for Data Engineering Manager / Software Development Manager
- 10+ years of engineering experience most of which is in Data domain
- 5+ years of engineering team management experience
- 10+ years of planning, designing, developing and delivering consumer software experience - Experience partnering with product or program management teams
- 5+ years of experience in managing data engineer, business intelligence engineers and/or data scientists
- Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems
- Experience managing multiple concurrent programs, projects and development teams in an Agile environment
- Strong understanding of Data Platform, Data Engineering and Data Governance
- Experience partnering with product and program management teams - Experience designing and developing large scale, high-traffic applications
PREFERRED QUALIFICATIONS
- AWS cloud technologies: Redshift, S3, Glue, EMR, Kinesis, Firehose, Lambda, IAM, Airflow
- Prior experience in Indian Banking segment and/or Fintech is desired.
- Experience with Non-relational databases and data stores
- Building and operating highly available, distributed data processing systems for large datasets
- Professional software engineering and best practices for the full software development life cycle
- Designing, developing, and implementing different types of data warehousing layers
- Leading the design, implementation, and successful delivery of large-scale, critical, or complex data solutions
- Building scalable data infrastructure and understanding distributed systems concepts
- SQL, ETL, and data modelling
- Ensuring the accuracy and availability of data to customers
- Proficient in at least one scripting or programming language for handling large volume data processing
- Strong presentation and communications skills.
About Company
Kotak Mahindra Bank is one of India's leading private sector banks, offering a wide range of financial services including personal banking, corporate banking, investment banking, insurance, and asset management. Established in 1985 and headquartered in Mumbai, it is known for its innovative banking solutions, customer-centric approach, and strong focus on digital transformation. The bank caters to diverse customer segments, from individuals to large corporations, emphasizing trust, transparency, and growth.
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
