About the Role

Our client is seeking a highly capable Generative AI Engineer (24 years of experience) with hands-on experience in building and deploying LLM-powered applications and scalable AI systems. This role is ideal for professionals who have moved beyond foundational AI knowledge and are actively designing, optimizing, and productionizing Generative AI solutions.

You will play a critical role in developing advanced AI applications, improving RAG pipelines, optimizing model performance, and contributing to architectural decisions. The position requires strong practical exposure to real-world AI systems and the ability to independently own features or modules within AI-driven products.

Key Responsibilities

Design and develop advanced Generative AI and LLM-powered applications (chatbots, copilots, AI agents, workflow automation systems, etc.).
Build and optimize end-to-end RAG pipelines including embedding strategies, chunking techniques, vector search, hybrid retrieval, and ranking layers.
Implement and manage vector databases such as Pinecone, Weaviate, FAISS, Qdrant, or Chroma.
Integrate LLMs into production-grade backend systems using Python (FastAPI/Flask) or Node.js.
Develop evaluation frameworks to measure prompt quality, reduce hallucinations, and improve response reliability.
Perform prompt optimization and context engineering for domain-specific use cases.
Support fine-tuning workflows and parameter-efficient training (LoRA/QLoRA).
Orchestrate AI workflows using frameworks like LangChain, LangGraph, LlamaIndex, or CrewAI.
Deploy AI services on AWS, GCP, or Azure with proper CI/CD and monitoring practices.
Optimize model inference costs, latency, and scalability.
Collaborate with cross-functional teams to translate business requirements into scalable AI solutions.
Mentor junior engineers and contribute to AI best practices.

Required Skills

24 years of professional experience in AI/ML, NLP, or backend engineering with strong exposure to Generative AI and LLM-based systems.
Strong proficiency in Python (Very Important) and API development using FastAPI/Flask.
Deep understanding of Large Language Models (LLMs), embeddings, transformers, and tokenization (Very Important).
Strong experience designing and implementing Retrieval-Augmented Generation (RAG) systems (Very Important).
Hands-on experience with vector databases (Very Important).
Solid foundation in Machine Learning and Natural Language Processing (Important).
Experience working with AI APIs (OpenAI, Anthropic, Gemini, HuggingFace, Cohere, etc.).
Experience with Docker (Important) and containerized deployments.
Working knowledge of MLOps practices including model versioning, monitoring, and CI/CD (Important).
Understanding of scalable backend systems, REST APIs, and distributed system fundamentals.
Exposure to prompt engineering and response optimization techniques.

Preferred Skills (Good to Have)

Experience with multimodal AI models (image, audio, or video models).
Hands-on experience with agent-based frameworks (LangGraph, CrewAI, AutoGen).
Knowledge of advanced retrieval techniques (hybrid search, reranking models, colBERT).
Familiarity with Kubernetes and container orchestration.
Experience building domain-specific AI assistants or internal AI tools.
Understanding of AI security, PII handling, and compliance best practices.
Exposure to caching layers and asynchronous job queues for scalable AI systems.
Experience with cloud computing environments (AWS, GCP, Azure).

Soft Skills

Strong analytical and problem-solving mindset.
Ability to independently own modules and deliver high-quality solutions.
Clear communication skills to explain technical decisions to stakeholders.
Ownership mindset with accountability for delivery.
Ability to work in a fast-paced and evolving AI ecosystem.
Continuous learner with curiosity for emerging AI technologies.
Collaborative team player with mentorship capability.

About YMinds.AI

YMinds.AI is a premier talent solutions company specializing in sourcing and delivering elite developers with cutting-edge AI expertise. We support global enterprises and fast-growing startups by connecting them with engineers who excel in building intelligent, scalable, and future-ready systems. Our clients are at the forefront of AI innovation, and we enable their success by providing exceptional talent that accelerates product development and drives technological advancement.

Keywords

Generative AI, LLM, RAG Systems, Vector Databases, Python, FastAPI, LangChain, LangGraph, LlamaIndex, CrewAI, Prompt Engineering, Fine-Tuning, LoRA, QLoRA, Machine Learning, NLP, Docker, MLOps, Cloud Computing, AI Agents, Scalable AI Systems.

#GenerativeAI #LLMEngineer #AIEngineer #RAGSystems #VectorSearch #LangChain #LangGraph #PythonDeveloper #OpenAI #HuggingFace #MachineLearning #NLP #MLOps #FineTuning #LoRA #QLoRA #CloudComputing #Docker #AIJobs #TechCareers

Generative AI Engineer (2–4 Years of Experience)

YMinds.AI

Job Description

Services you might be interested in

Improve Your Resume Today