Job Summary
We are seeking an experienced R2 Architect with 10 to 13 years of experience in SRE DevOps and SRE Concepts. The ideal candidate will work in a hybrid model primarily during the day shift. This role does not require travel. The candidate will play a crucial role in ensuring the reliability and efficiency of our systems contributing to the companys overall success and societal impact.
Responsibilities
- Lead the design and implementation of SRE practices to enhance system reliability and performance.
- Oversee the development and maintenance of automated solutions for system monitoring and incident response.
- Provide technical guidance and mentorship to the SRE team to ensure best practices are followed.
- Collaborate with cross-functional teams to identify and address system bottlenecks and performance issues.
- Implement and manage CI/CD pipelines to streamline software delivery processes.
- Develop and maintain comprehensive documentation for SRE processes and procedures.
- Conduct regular system audits and performance reviews to ensure optimal operation.
- Implement robust incident management protocols to minimize downtime and service disruptions.
- Monitor system health and performance metrics to proactively address potential issues.
- Drive continuous improvement initiatives to enhance system reliability and efficiency.
- Ensure compliance with industry standards and best practices in SRE and DevOps.
- Facilitate effective communication and collaboration between development and operations teams.
- Utilize data-driven insights to inform decision-making and optimize system performance.
Qualifications
- Possess extensive experience in SRE DevOps and SRE Concepts.
- Demonstrate proficiency in implementing and managing CI/CD pipelines.
- Exhibit strong problem-solving skills and the ability to address complex system issues.
- Have a solid understanding of automated monitoring and incident response solutions.
- Show excellent communication and collaboration skills to work effectively with cross-functional teams.
- Maintain a proactive approach to system health and performance monitoring.
- Display a commitment to continuous improvement and staying updated with industry trends.
- Hold relevant certifications in SRE or DevOps practices.
- Bring a proven track record of enhancing system reliability and efficiency.
- Demonstrate the ability to mentor and guide team members in best practices.
- Exhibit strong organizational skills and attention to detail.
- Have experience in developing and maintaining comprehensive documentation.
- Show a commitment to ensuring compliance with industry standards and best practices.