Login Sign Up
🔔 FCM Loaded

Site Reliability Engineer

Datum Technologies Group

2 - 5 years

Chennai

Posted: 13/04/2026

Getting a referral is 5x more effective than applying directly

Job Description

Experience - 8+ years

Work Mode - Hybrid (2 WFO)

Work Location - Chennai, Mumbai and Gurugram


Key Responsibilities

  • Manage, maintain, and troubleshoot Linux-based systems and environments.
  • Design, develop, and maintain Infrastructure as Code using Terraform, including writing Terraform modules from scratch.
  • Deploy, configure, and manage Azure cloud infrastructure to support scalable and reliable applications.
  • Administer and maintain Kubernetes clusters, particularly Azure Kubernetes Service (AKS).
  • Perform Kubernetes cluster lifecycle management, including upgrades, scaling, monitoring, and troubleshooting.
  • Implement and maintain CI/CD pipelines, preferably using GitHub Actions.
  • Ensure system reliability, availability, and performance by following SRE and DevOps best practices.
  • Collaborate with development teams to improve deployment automation and infrastructure efficiency.
  • Monitor infrastructure and applications using monitoring tools and proactively resolve issues.

Mandatory Skills

  • Strong experience with Linux OS administration.
  • Hands-on experience with Terraform, including module creation and infrastructure automation.
  • Solid experience working with Microsoft Azure Cloud.
  • Strong expertise in Kubernetes cluster management, particularly Azure Kubernetes Service (AKS).
  • Experience with CI/CD tools, preferably GitHub Actions.

Services you might be interested in

Improve Your Resume Today

Boost your chances with professional resume services!

Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.