Login Sign Up
🔔 FCM Loaded

Gen AI Engineer

L&T Technology Services

2 - 5 years

Pune

Posted: 20/03/2026

Getting a referral is 5x more effective than applying directly

Job Description

Experience- 3 to 8 Years

Location: Pune

Job Description:


We are looking for a hands on GenAI Engineer to design, build, and productionize AI solutions that leverage Retrieval Augmented Generation (RAG), Large Language Models (LLMs), LLMOps, and multi agent systems. Youll own the end to end lifecyclefrom data ingestion and orchestration to deployment, evaluation, guardrails, and monitoringwhile collaborating with product, platform, and domain teams to ship reliable, cost efficient AI features at scale.


What Youll Do (Key Responsibilities)



Solution Architecture & Delivery


Design RAG pipelines (chunking, indexing, embeddings, retrieval, re ranking, synthesis) and select optimal vector DBs & re rankers for each use case.

Build multi agent workflows (planner/executor, tool using agents, collaborative agents) with robust state management and failure recovery.

Implement LLMOps: automated evaluations, data & model lineage, observability, cost controls, rollback strategies.



Data & Retrieval Engineering


Build ingestion pipelines for unstructured/semi structured content (PDFs, Office docs, HTML, emails, logs) with robust parsing, PII redaction, deduplication, and metadata enrichment.

Optimize embeddings (model selection, dimensionality, multilingual handling) and retrieval quality (query transformation, hybrid search, learning to rank).



Prompt Engineering & Safety


Develop system prompts, tool calling schemas, and guardrails (content, privacy, compliance) with iterative prompt optimization and A/B testing.

Implement safety, policy, and governance: jailbreak resistance, hallucination mitigation, citation enforcement, rate limit handling.



Orchestration & Deployment


Productionize pipelines using workflow engines (e.g., LangGraph, Autogen, CrewAI, Airflow/Prefect) with containerization (Docker) and Kubernetes.

Deploy & scale inference (vLLM, Triton, Ray, Helm) across cloud/on prem; manage secrets, keys, and per tenant configs.



Evaluation, Monitoring & Cost Optimization


Define and track metrics: answer correctness, faithfulness, groundedness, latency, throughput, token cost, retrieval hit@k.

Integrate observability (Prometheus/Grafana), tracing (OpenTelemetry), and E2E analytics (W&B/MLflow) with automated regression tests.



Tech Stack (You dont need all; we value depth in relevant areas)


LLM & RAG: OpenAI/Azure OpenAI, Anthropic, Google, Cohere; open source models via Hugging Face (Llama, Mistral, Qwen).

RAG Components: FAISS, Milvus, Pinecone

Frameworks: LangChain, LlamaIndex, LangGraph, Autogen, CrewAI; tool calling & function schemas.

Orchestration & Pipelines: Airflow, Prefect, Dagster; Ray for distributed workloads.

Serving: vLLM, Triton Inference Server, FastAPI, gRPC; Helm/K8s, Istio/Linkerd.

Ops & Observability: MLflow, Weights & Biases, OpenTelemetry, Prometheus/Grafana; Feature/Model Registry.

Data & Parsing: Unstructured, Apache Tika, Textract, Tesseract; Pandas/Spark.

Caching & Messaging: Redis, Kafka.



Qualifications (Must Have)


Solid hands on experience building with LLMs and RAG, including retrieval tuning and prompt/system design.

Proven ability to productionize GenAI workloads (Kubernetes, CI/CD, containerization, secrets, autoscaling, rollout/rollback).

Experience with LLMOps: evaluation frameworks (e.g., RAGAS/DeepEval/HELM style metrics), tracing, monitoring, and cost management.

Strong prompt engineering expertise: tool calling, schema design, structured outputs, guardrails, and prompt A/B testing.

Exposure to multi agent systems (planner/executor, tool orchestration, memory/state) and their failure modes.

Proficiency in Python (typing, testing, packaging) and building APIs/services (FastAPI) with clean architecture patterns.

Understanding of data privacy, security, governance in AI systems (PII handling, policy enforcement, model risk).


Nice to Have


Experience with vector DB benchmarking and embedding model selection

Document intelligence (layout aware parsing, table extraction, OCR quality improvement).

Domain exposure to SDLC automation (code review assist, test generation, requirement traceability) or manufacturing/agri OEM knowledge.

Contributions to open source GenAI projects; publications or talks a plus.

Services you might be interested in

Improve Your Resume Today

Boost your chances with professional resume services!

Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.