Experience - 8+ years

Work Mode - Hybrid (2 WFO)

Work Location - Chennai, Mumbai and Gurugram

Key Responsibilities

Manage, maintain, and troubleshoot Linux-based systems and environments.
Design, develop, and maintain Infrastructure as Code using Terraform, including writing Terraform modules from scratch.
Deploy, configure, and manage Azure cloud infrastructure to support scalable and reliable applications.
Administer and maintain Kubernetes clusters, particularly Azure Kubernetes Service (AKS).
Perform Kubernetes cluster lifecycle management, including upgrades, scaling, monitoring, and troubleshooting.
Implement and maintain CI/CD pipelines, preferably using GitHub Actions.
Ensure system reliability, availability, and performance by following SRE and DevOps best practices.
Collaborate with development teams to improve deployment automation and infrastructure efficiency.
Monitor infrastructure and applications using monitoring tools and proactively resolve issues.

Mandatory Skills

Strong experience with Linux OS administration.
Hands-on experience with Terraform, including module creation and infrastructure automation.
Solid experience working with Microsoft Azure Cloud.
Strong expertise in Kubernetes cluster management, particularly Azure Kubernetes Service (AKS).
Experience with CI/CD tools, preferably GitHub Actions.

Site Reliability Engineer