Openshift L4 Engineer
Tata Consultancy Services
2 - 5 years
Bengaluru
Posted: 28/04/2026
Getting a referral is 5x more effective than applying directly
Job Description
TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and theres nothing that can stop us from growing together.
What we are looking for
Role: Openshift L4 Engineer
Experience Range: 10 to 15 Years
Location: Chennai, Kolkata, Hyderabad, Bangalore, Pune, Delhi
Must Have:
- Expert-Level OpenShift: Deep, authoritative knowledge of OCP installation (IPI/UPI), upgrades, cluster administration, node management, and disaster recovery.
- Kubernetes Mastery: A fundamental and deep understanding of Kubernetes architecture and components (etcd, kube-apiserver, scheduler, etc.) and Operators (OLM).
- Infrastructure as Code (IaC): Strong proficiency withAnsible and Terraform for automating infrastructure provisioning and configuration management.
- Programming/Scripting: Advanced scripting and software development skills in Python or Go, as well as Bash.
- Observability: Hands-on experience building and managing monitoring and logging solutions (e.g., Prometheus, Grafana, Thanos, Alertmanager, ELK Stack, Splunk, Fluentd/Vector/OTEL).
- CI/CD & GitOps: Expertise with CI/CD tooling (e.g., Tekton ,Jenkins, GitLab CI, ArgoCD, GitHub Actions).
- Core Infrastructure: Strong proficiency in Linux/RHEL administration, networking (SDN, OVS, routing, firewalls, load balancer), and storage (Ceph, NFS, block storage, Object).
Good to Have:
- 8+ years of overall experience in roles such as Site Reliability Engineering, DevOps, or Linux Systems Engineering.
- 5+ years of hands-on, intensive experience administering, automating, and troubleshooting Red Hat OpenShift (OCP 4.x preferred) in large-scale production environments.
- Proven experience in a senior or lead engineering role, demonstrating ownership of complex projects and mentorship of others.
Essential:
- Define and Uphold Reliability Standards: Establish and manage Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets for the OpenShift platform and its core services.
- Automate Everything: Design, build, and maintain robust automation to handle the full lifecycle of OpenShift clusters, including provisioning, upgrades, patching, scaling, and disaster recovery.
- Reduce Toil: Proactively identify and eliminate manual, repetitive operational work by developing and maintaining automation scripts (Python, Go, Bash) and Ansible playbooks.
- Incident Response and Root Cause Analysis: Lead high-severity incident response and conduct deep, blameless post-mortems to identify and implement permanent solutions to prevent recurrence.
- Proactive Health Management: Develop and implement automated health checks and self-healing capabilities to ensure cluster and application resilience.
- Subject Matter Expertise: Serve as the top-tier technical authority for OpenShift Container Platform architecture, networking (OVN-Kubernetes, SDN), load balancing, cross cluster management, storage (OpenShift Data Foundation/Ceph), and security.
- Observability: Architect and manage a comprehensive observability stack (e.g., Prometheus, Grafana, ELK/Fluentd) to provide deep insights into platform and application performance.
- CI/CD and GitOps: Engineer and optimize CI/CD pipelines for both platform components and tenant applications, championing GitOps principles for declarative configuration management.
- Capacity and Performance: Conduct advanced performance tuning, load testing, and capacity planning to ensure the platform can meet future demand.
Minimum Qualification:
- 15 years of full-time education
- Minimum percentile of 50% in 10th, 12th, UG & PG (if applicable)
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
