Job Summary
1. Good Experience in Dynatrace ( Dashboard Alerts Events and MTE integration)
2. Experience Splunk ITSI
3. Good knowledge in SRE concepts
4. Independent developer and should have good communication
5. Terraform and AWS knowledge is secondary
6. Other technology like Java/.Net experience is added advantage(Not Mandate)
7. Ready for Production support and minor enhancements
Responsibilities
Oversee the design implementation and maintenance of infrastructure systems to ensure optimal performance and reliability.Provide expertise in Site Reliability Engineering (SRE) to enhance system stability and efficiency.Utilize Dynatrace AppMon for application performance monitoring and troubleshooting.Leverage Splunk for log management data analysis and system monitoring.Collaborate with cross-functional teams to identify and resolve infrastructure issues.Develop and implement automation scripts to streamline operational processes.Monitor system performance and proactively address potential issues.Ensure compliance with security policies and best practices.Conduct regular system audits and generate performance reports.Participate in capacity planning and infrastructure scaling activities.Provide technical support and guidance to team members.Stay updated with the latest industry trends and technologies.Contribute to the continuous improvement of infrastructure processes and practices.
Qualifications
Possess a strong background in Site Reliability Engineering (SRE).Demonstrate proficiency in using Dynatrace AppMon for performance monitoring.Exhibit expertise in Splunk for data analysis and system monitoring.Have a solid understanding of infrastructure design and maintenance.Show experience in developing and implementing automation scripts.Display excellent problem-solving and troubleshooting skills.