DevOps Engineer – ML & RAG
Talentgigs
2 - 5 years
Hyderabad
Posted: 07/06/2026
Job Description
Role DevOps Engineer
Role Overview: YOE: 5+
Work Mode: 4+1 Hybrid
Number of Openings: 1
Job Description: The DevOps Engineer will play a critical role in operationalizing artificial intelligence across Bell Techlogix client environments. This role focuses on building and supporting cloud infrastructure, CI/CD pipelines, and automation frameworks that power AI and machine learning workloads. The ideal candidate has experience supporting AI platforms such as Azure AI, Azure Machine Learning, Azure OpenAI, and ServiceNow or conversational AI platforms, and understands the operational requirements of production AI systems, including reliability, scalability, and security.
Key Responsibilities
Design, build, and operate cloud infrastructure and platform services that support AI and machine learning workloads in production, SLA-driven managed services environments
Implement CI/CD and MLOps pipelines to enable automated training, testing, deployment, and rollback of AI and ML models
Develop and maintain Infrastructure as Code to provision AI-ready environments consistently across dev/test/prod
Support AI platform operations including monitoring model health, pipeline execution, compute utilization, and data dependencies
Partner with Machine Learning Engineers and Data Engineers to standardize deployment patterns for AI services and LLM-based solutions
Enable secure and scalable AI integrations using APIs, messaging, and event-driven architectures
Implement observability solutions for AI platforms, including logging, metrics, alerting, and drift detection integrations
Troubleshoot AI platform incidents, perform root cause analysis, and implement remediation to improve reliability and automation coverage
Apply security best practices for AI environments including secrets management, identity and access controls, network isolation, and policy enforcement
Support AI-driven automation use cases across platforms such as Microsoft Copilot, ServiceNow, and conversational AI tools
Collaborate with service desk, security, and architecture teams to continuously improve AI service delivery and operational maturity
Required Qualifications
Bachelors degree in Computer Science, Engineering, or equivalent practical experience
5+ years of experience in DevOps, cloud engineering, or platform operations, with exposure to AI or data workloads
Hands-on experience with LLM-based applications, including Retrieval-Augmented Generation (RAG)
Hands-on experience with Microsoft Azure, including compute, networking, storage, and monitoring services
Experience building CI/CD pipelines using Azure DevOps, GitHub Actions, or similar tools
Working knowledge of Infrastructure as Code (Terraform and/or Bicep/ARM)
Scripting experience using PowerShell and/or Python
Experience supporting production platforms with incident management, change control, and root cause analysis
Understanding of cloud security fundamentals and enterprise governance requirements Preferred Qualifications
Experience with Azure Machine Learning, Azure AI Services, Azure OpenAI, or MLOps frameworks
Exposure to containerization and orchestration technologies (Docker, Kubernetes, AKS)
Experience supporting data pipelines or feature stores used by machine learning systems
Familiarity with ServiceNow, AI-driven ITSM workflows, or automation platforms
Experience with observability tools
Knowledge of Responsible AI, data governance, and compliance considerations for AI systems
Relevant certifications (Microsoft Azure Administrator, Azure DevOps Engineer, Azure AI Engineer)
Services you might be interested in
We Search & Apply Jobs for You!
Our team scans through 1000s of opportunities and applies to roles best suited to your profile
Save 100+ hours and focus on what matters - cracking interviews and landing offers.
