Data Engineer II
Amazon
2 - 5 years
Bengaluru
Posted: 01/08/2025
Job Description
Amazon Regulatory Intelligence Safety and Risk (RISC) team mission is to protect customers from products that are unsafe, illegal, illegally marketed, controversial or otherwise in violation of Amazon’s policies while enabling our Selling Partners to offer their broadest selection of safe and compliant products.
We achieve these objectives worldwide by: (1) taking a science
- first approach to offer trustworthy listings to our customers, (2) inventing intuitive and precise tools to simplify our selling partners’ compliance journey and (3) innovating to reduce our cost to serve.
The RISC Data Engineering team is seeking an experienced Data Engineer with solid engineering skills and machine learning background (MLOps) to join our team.
In this role, you will be responsible for designing, building, and maintaining large scale robust data pipelines and infrastructure to empower our machine learning, data science and analytics initiatives.
You will collaborate closely with Applied Scientists, Machine Learning Scientists, and business stakeholders to understand their requirements and support AI/ML solutions.
Join our expert team to build scalable data solutions, improving Amazon business efficiency and simplifying our selling partners' compliance journey.
Key job responsibilities
1.
Design, build, and maintain scalable, fault
- tolerant, and efficient data pipelines and infrastructure for machine learning operations (MLOps) leveraging AWS technologies such as Lambda, Glue, EMR/Spark, Step Functions, Airflow, DynamoDB and AWS Batch.
2.
Automate infrastructure deployment, maintenance processes, and incorporate CI/CD principles to streamline the MLOps ecosystem, using AWS services and scripting languages like Python or Scala.
3.
Develop optimized data models, ETL/ELT processes, data transformations, and data warehouse to ensure high
- quality, well
- structured data for ML and analytics, using S3, Redshift, Glue, Athena and Lake Formation.
4.
Collaborate closely with Applied Scientists, Machine Learning Scientists, and analytics teams to understand data requirements, and provide scalable data solutions.
5.
Adopt genAI solutions to transform and enhance data engineering and MLOps processes.
6.
Continuously monitor, optimize, and enhance data pipelines, processes, and infrastructure to support ML and analytics.
7.
Implement and enforce rigorous data governance, security, and compliance standards for our data, including data validation, cleansing, and lineage tracking.
8.
Mentor junior engineers, promoting best practices and knowledge sharing in data engineering and MLOps.
9.
Stay updated with emerging technologies, tools, and trends, incorporating them into the existing ecosystem for continuous improvement.
About the team
Who Are We
We are a team of scientists and engineers building AI/ML and data solutions to improve Amazon business efficiency and simplify our selling partners' compliance journey.
Work/Life Balance
We value work
- life harmony.
Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture.
When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer.
That’s why you’ll find endless knowledge
- sharing, mentorship and other career
- advancing resources here to help you develop into a better
- rounded professional.
Basic Qualifications
- Bachelor's degree in computer science, engineering, mathematics, statistics or a related field
- 3+ years of data engineering experience
- Experience with ML
- Experience with data modeling, warehousing and building ETL pipelines
- Knowledge of distributed systems
- Knowledge of professional software engineering & best practices for full software development life cycle, including coding standards, software architectures, code reviews, source control management, continuous deployments, testing, and operational excellence
Preferred Qualifications
- Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, Step Functions, Airflow, DynamoDB and AWS Batch, SageMaker, IAM roles and permissions
- Experience with non
- relational databases / data stores (object storage, document or key
- value stores, graph databases, column
- family databases)
- Experience with advanced ML system design, implementation and maintenance
- Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
- Strong problem
- solving and engineering skills, with the ability to translate business requirements into technical solutions
Our inclusive culture empowers Amazonians to deliver the best results for our customers.
If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how
- we
- hire/accommodations for more information.
If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
About Company
Amazon is a multinational technology and e-commerce company founded by Jeff Bezos in 1994. Initially focused on selling books online, it quickly expanded into a broad range of products and services, including electronics, cloud computing (via Amazon Web Services), streaming, and artificial intelligence. Amazon has revolutionized online shopping with fast delivery, personalized recommendations, and a subscription service called Amazon Prime. It is one of the world's largest and most valuable companies, with a significant impact on retail, technology, and logistics.
Services you might be interested in
One-Shot Campaign
Reach out to ideal employees in one shot!
The intelligent campaign for reaching out to the ideal audience to whom you can ask for help (guidance or referral).