Observability Lead
Persistent Systems
5 - 10 years
Pune
Posted: 21/03/2026
Job Description
About Position:
Observability in DevOps is the practice of using logs, metrics, and traces to deeply understand the internal state of complex, distributed systems from their external outputs. It enables teams to proactively debug, troubleshoot, and resolve unknown, real-time issues, moving beyond simple, reactive monitoring.
- Role: Observability Lead
- Location: All Persistent Locations
- Experience: 10 to 15 years
- Job Type: Full Time Employment
What You'll Do:
- Building, maintaining, and optimizing dashboards using Grafana
- Working with Prometheus, Loki, Tempo, or similar monitoring and logging stacks
- Enhancing system visibility, alerting, SLO/SLI tracking, and overall performance monitoring
- Designing scalable, production-grade monitoring and observability frameworks
- Implementing and maintaining CI/CD pipelines, infrastructure automation, and deployment workflows
- Partnering with DevOps/SRE teams to strengthen observability maturity and improve platform reliability
- Contributing to infrastructure design using tools like Terraform, Kubernetes, Helm, etc.
- Driving continuous improvement in system uptime, resiliency, and operational excellence.
Expertise You'll Bring:
- Grafana Stack
- Build and manage endtoend observability dashboards using Grafana, Loki, Tempo, and Mimir for metrics, logs, and traces.
- Kubernetes
- Implement and optimize observability across Kubernetes clusters including monitoring, logging, tracing, and autoscaling insights.
- Jenkins CI
- Design and maintain CI pipelines in Jenkins with integrated observability checks and automated deployment workflows.
- Prometheus
- Set up and manage Prometheus for metrics scraping, alert rules, service monitoring, and custom exporter integrations.
- Terraform
- Use Terraform to automate provisioning of observability components with scalable, versioncontrolled infrastructure-as-code.
- Shell / Python
- Develop Shell and Python scripts for automation, alert generation, log parsing, and custom observability tooling.
- GitOps
- Implement GitOps workflows using ArgoCD/FluxCD for consistent, automated deployment of observability configurations and dashboards.
Benefits:
- Competitive salary and benefits package
- Culture focused on talent development with quarterly growth opportunities and company-sponsored higher education and certifications
- Opportunity to work with cutting-edge technologies
- Employee engagement initiatives such as project parties, flexible work hours, and Long Service awards
- Annual health check-ups
- Insurance coverage: group term life, personal accident, and Mediclaim hospitalization for self, spouse, two children, and parents
Values-Driven, People-Centric & Inclusive Work Environment:
Persistent is dedicated to fostering diversity and inclusion in the workplace. We invite applications from all qualified individuals, including those with disabilities, and regardless of gender or gender preference. We welcome diverse candidates from all backgrounds.
- We support hybrid work and flexible hours to fit diverse lifestyles.
- Our office is accessibility-friendly, with ergonomic setups and assistive technologies to support employees with physical disabilities.
- If you are a person with disabilities and have specific requirements, please inform us during the application process or at any time during your employment
Let's unleash your full potential at Persistent - persistent.com/careers
"Persistent is an Equal Opportunity Employer and prohibits discrimination and harassment of any kind."
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
