🔔 FCM Loaded

ITOPS / Observability AI Architect

Themesoft Inc.

2 - 5 years

Bengaluru

Posted: 17/02/2026

Getting a referral is 5x more effective than applying directly

Job Description

Job Title: ITOPS / Observability AI Architect

Job Summary

We are seeking a highly experienced IT Operations (ITOPS) and Observability AI Architect to

lead the design, development, and implementation of advanced observability and AIOps

solutions for enterprise clients. The ideal candidate should have 10+ years of experience in

technical development and architecture roles, with deep expertise in observability

platforms, AIOps tools, cloud-native architectures (Azure/AWS), containerization,

orchestration, and automation. This role requires a strong understanding of modern

observability technologies, AI-driven operations, and the ability to architect scalable,

intelligent systems that enhance operational efficiency and resilience.

Key Responsibilities

Develop end-to-end observability and AIOps architectures for large-scale enterprise

environments.

Define standards and best practices for monitoring, alerting, and automated

remediation.

Drive the deployment and integration of observability platforms and AIOps tools

across hybrid and multi-cloud environments.

Ensure seamless integration with ITSM, DevOps, and CI/CD pipelines.

Evaluate emerging technologies in observability and AIOps to recommend strategic

adoption.

Design AI/ML-driven predictive analytics for proactive incident management and

root cause analysis.

Work closely with clients, operations, and business teams to align architecture with

organizational goals.

Mentor technical teams on observability and AIOps best practices.

Optimize system performance through advanced telemetry, distributed tracing, and

anomaly detection.

Implement automated workflows for incident prevention and resolution.Experience & Qualifications

MCA, B.E degree in Computer Science, or related field.

10+ years in IT architecture or technical leadership roles.

Proven expertise in observability tools (e.g., Dynatrace, Datadog, New Relic,

Prometheus, Grafana) and AIOps platforms (e.g., Moogsoft, BigPanda, ServiceNow

AIOps).

Strong experience with Azure/AWS cloud architectures, containerization (Docker),

and orchestration (Kubernetes).

Hands-on experience with automation frameworks and infrastructure-as-code

(Terraform, Ansible).

Hands-on experience in IT operations preferably with IT infrastructure and

applications services

Skills:

Deep understanding of monitoring, logging, distributed tracing, and telemetry.

Knowledge of AI/ML concepts applied to IT operations.

Excellent problem-solving, communication, and leadership skills

Good understanding and exposure to ITIL frameworks

Preferred:

Certifications in cloud platforms (AWS/Azure), Kubernetes, or observability tools.

Experience in designing self-healing systems and predictive analytics for IT

operations.

Services you might be interested in

Improve Your Resume Today

Boost your chances with professional resume services!

Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.