Site Reliability Engineer
Datum Technologies Group
2 - 5 years
Chennai
Posted: 13/04/2026
Getting a referral is 5x more effective than applying directly
Job Description
Experience - 8+ years
Work Mode - Hybrid (2 WFO)
Work Location - Chennai, Mumbai and Gurugram
Key Responsibilities
- Manage, maintain, and troubleshoot Linux-based systems and environments.
- Design, develop, and maintain Infrastructure as Code using Terraform, including writing Terraform modules from scratch.
- Deploy, configure, and manage Azure cloud infrastructure to support scalable and reliable applications.
- Administer and maintain Kubernetes clusters, particularly Azure Kubernetes Service (AKS).
- Perform Kubernetes cluster lifecycle management, including upgrades, scaling, monitoring, and troubleshooting.
- Implement and maintain CI/CD pipelines, preferably using GitHub Actions.
- Ensure system reliability, availability, and performance by following SRE and DevOps best practices.
- Collaborate with development teams to improve deployment automation and infrastructure efficiency.
- Monitor infrastructure and applications using monitoring tools and proactively resolve issues.
Mandatory Skills
- Strong experience with Linux OS administration.
- Hands-on experience with Terraform, including module creation and infrastructure automation.
- Solid experience working with Microsoft Azure Cloud.
- Strong expertise in Kubernetes cluster management, particularly Azure Kubernetes Service (AKS).
- Experience with CI/CD tools, preferably GitHub Actions.
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
