Site Reliability Engineer
Datum Technologies Group
2 - 5 years
Chennai
Posted: 15/05/2026
Getting a referral is 5x more effective than applying directly
Job Description
Work Experience: 7+ years
Work Mode: Hybrid(2 days WFO)
Work Location: Chennai, Mumbai and Gurgaon
Key Responsibilities
- Manage and troubleshoot Linux-based systems and environments
- Develop and maintain Infrastructure as Code using Terraform, including writing Terraform modules from scratch
- Design and manage scalable infrastructure on Microsoft Azure
- Administer and maintain Kubernetes clusters, particularly Azure Kubernetes Service (AKS)
- Handle Kubernetes cluster lifecycle management (scaling, upgrades, troubleshooting, and maintenance)
- Build and maintain CI/CD pipelines, preferably using GitHub Actions
- Implement DevOps and SRE best practices to improve system reliability and automation
- Collaborate with development teams to streamline deployment and infrastructure processes
Mandatory Skills
Strong experience in Linux OS
Hands-on experience with Terraform (module development)
Experience working with Azure Cloud
Expertise in Kubernetes cluster management (AKS)
Experience with CI/CD tools (preferably GitHub Actions)
Good to Have
- Experience with monitoring tools such as ELK Stack, Prometheus, or Grafana
- Exposure to DevOps monitoring and automation practices
Preferred Certifications
- Terraform Certification
- Microsoft Azure Certification (AZ-900 or higher)
- Kubernetes Certifications CKA / CKAD / CKS
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
