Job Summary
We are seeking a Sr. Developer with 5 to 7 years of experience to join our dynamic team. The ideal candidate will have expertise in PySpark Cloud Dataproc and Cloud Dataflow. This hybrid role offers the flexibility of working both remotely and on-site with no travel required. The successful candidate will contribute to our innovative projects driving impactful solutions that align with our companys goals.
Responsibilities
Develop and maintain scalable data processing pipelines using PySpark to ensure efficient data handling.Implement and optimize data workflows on Cloud Dataproc to enhance performance and reliability.Utilize Cloud Dataflow to design and execute data transformation and integration tasks.Collaborate with cross-functional teams to gather and analyze requirements for data processing solutions.Provide technical guidance and support to junior developers fostering a collaborative and learning-oriented environment.Conduct code reviews to ensure adherence to best practices and maintain high code quality standards.Monitor and troubleshoot data processing jobs to identify and resolve issues promptly.Implement data security and compliance measures to protect sensitive information.Participate in the design and architecture discussions to contribute to the overall data strategy.Develop and maintain documentation for data processing workflows and procedures.Stay updated with the latest industry trends and technologies to continuously improve data processing capabilities.Contribute to the development of data analytics solutions to support business decision-making.Ensure seamless integration of data processing solutions with existing systems and platforms.
Qualifications
Possess strong expertise in PySpark for developing efficient data processing pipelines.Demonstrate experience with Cloud Dataproc for managing and optimizing data workflows.Have hands-on experience with Cloud Dataflow for data transformation and integration tasks.Exhibit excellent problem-solving skills and the ability to troubleshoot complex data processing issues.Show proficiency in collaborating with cross-functional teams to gather and analyze requirements.Display strong communication skills to provide technical guidance and support to junior developers.Maintain a proactive approach to staying updated with industry trends and technologies.Demonstrate a commitment to maintaining high code quality standards through code reviews.Possess knowledge of data security and compliance measures to protect sensitive information.Show experience in contributing to the design and architecture of data processing solutions.Exhibit the ability to develop and maintain comprehensive documentation for workflows and procedures.Demonstrate a strong understanding of integrating data processing solutions with existing systems.Have a proven track record of contributing to data analytics solutions for business decision-making.
Certifications Required
Certified PySpark Developer Google Cloud Professional Data Engineer