Observability Engineer
Astria Digital
2 - 5 years
Gurugram
Posted: 22/02/2026
Getting a referral is 5x more effective than applying directly
Job Description
Key Responsibilities
- Manage and support Linux-based infrastructure and containerized environments (Docker, Kubernetes).
- Administer and optimize large-scale Elasticsearch clusters, including configuration, scaling, performance tuning, and troubleshooting.
- Provide end-to-end system administration support across enterprise environments.
- Perform deep-dive troubleshooting across infrastructure, networking, and observability stack components.
- Support ITSM processes including incident, change, and problem management.
- Manage hardware and software lifecycle activities.
- Ensure platform stability, high availability, and performance optimization.
- Collaborate with Platform Engineering and SRE teams to enhance observability maturity.
- Assist in deployment, upgrades, and governance of observability tools.
- Contribute to automation initiatives and operational efficiency improvements.
Required Qualifications
- Strong expertise in Linux system administration.
- Hands-on experience with:
- Docker
- Kubernetes (production environments)
- Elasticsearch (architecture, configuration, tuning, troubleshooting)
- Strong understanding of networking concepts (TCP/IP, DNS, load balancing, firewalls, routing).
- Experience working within ITSM frameworks and enterprise environments.
- Excellent troubleshooting and root cause analysis skills.
- Experience with Rancher, OpenSearch, OpenTelemetry.
- Knowledge of observability concepts such as Distributed Tracing, Metrics, Monitoring, and Logging.
- Experience managing large-scale Elasticsearch deployments.
- Hands-on experience with tools:
- Jaeger
- Kibana
- Grafana
- Prometheus
- Splunk
- Dynatrace
- Kafka
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
