Lead AI/ML Engineer
VAYUZ Technologies
5 - 10 years
Thane
Posted: 27/04/2026
Getting a referral is 5x more effective than applying directly
Job Description
Job Summary
We are looking for an experienced ML Engineer / MLOps Engineer with strong expertise in building and deploying Large Language Model (LLM) applications in production environments. The ideal candidate will have hands-on experience in developing scalable AI systems, implementing RAG architectures, and ensuring performance, security, and reliability of LLM-based solutions.
Key Responsibilities
- Production Application Development
- Lead the end-to-end lifecycle of LLM applications, transforming prototypes into scalable, reliable, and production-ready systems.
- LLM API Integration & Orchestration
- Design and implement integrations with LLM APIs (e.g., OpenAI, Anthropic, internal models) with a focus on performance, cost optimization, and reliability.
- Prompt Engineering & Optimization
- Develop and refine advanced prompt strategies to ensure high-quality, accurate, and context-aware outputs.
- Context Management & RAG Implementation
- Build and optimize Retrieval-Augmented Generation (RAG) systems using vector databases and memory frameworks to improve response relevance.
- Output Validation & Quality Assurance
- Create validation pipelines to verify model outputs, reduce hallucinations, and ensure compliance with defined quality standards.
- AI Security & Risk Mitigation
- Implement safeguards against vulnerabilities such as prompt injection, indirect injection, and SQL injection across LLM applications.
- Production Deployment & Monitoring
- Deploy solutions using MLOps best practices across cloud platforms (AWS, GCP, Azure) and monitor performance, latency, token usage, and drift.
Required Skills & Qualifications
- Experience 5+ years of experience as an ML Engineer / MLOps Engineer
- Proven experience in deploying LLM applications into production environments
Technical Skills
- Strong proficiency in Python
- Experience with ML frameworks such as PyTorch or TensorFlow
- Hands-on experience with orchestration tools like Kubeflow or Airflow
- Experience with cloud platforms (AWS, GCP, or Azure)
- Knowledge of containerization (Docker, Kubernetes)
- Experience with vector databases (Pinecone, Weaviate, Chroma)
- Familiarity with RAG architectures and MLOps tools
LLM Expertise
- Strong understanding of LLM capabilities, limitations, and prompt engineering
- Awareness of security risks and mitigation strategies in generative AI
Core Competencies
- Strong problem-solving and analytical skills
- Ability to troubleshoot production-level issues (latency, performance, stability)
- Excellent communication and collaboration skills
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
