🔔 FCM Loaded

Principal AI Architect - Conversational AI & RAG Systems

alphastream.ai

2 - 5 years

Bengaluru

Posted: 12/02/2026

Getting a referral is 5x more effective than applying directly

Job Description

Principal AI Architect - Conversational AI & RAG Systems


Designation -Member of Technical Staff

Location: Bangalore

Experience: 8+ years

Mode of work: Work from Office


Role Overview

We're looking for a Principal AI Architect to lead the design and architecture of our conversational AI platform that powers private credit analysis for institutional investors. You'll architect end-to-end RAG pipelines, build MCP servers over complex financial datasets, and design agentic workflows that enable multi-step reasoning across deals, funds, and market data.

This is a hands-on architectural role where you'll both design systems and guide implementation. You'll lead our Alpha AI squad, partner with R&D leadership on research direction, and set standards for AI/ML engineering across the company.

Impact: Your architecture will directly determine how fast and accurately our AI can analyse private credit portfolios ($100M+ AUM clients), which translates to deal-closing speed for asset managers.

What You'll Build

1. Conversational AI Architecture

  • Design MCP (Model Context Protocol) server architecture for structured financial data access
  • Architect RAG pipeline: retrieval re-ranking context funneling generation
  • Build tool orchestration layer: function calling, parameter validation, response parsing
  • Design agentic workflows with task decomposition and multi-turn reasoning
  • Example: Architect a system where "Compare portfolio risk across Q1 and Q2" triggers: data extraction metric computation trend analysis response synthesis

2. Retrieval System Design

  • Design hybrid search strategy combining keyword (BM25) + semantic (vector) search
  • Architect vector database strategy: embedding models, indexing (HNSW/IVF), sharding
  • Implement re-ranking with cross-encoders and Maximum Marginal Relevance (MMR)
  • Build citation system linking every generated statement to source document + page number
  • Optimize query expansion, passage relevance scoring, and metadata filtering
  • Performance target: Retrieval recall@10 > 90%, answer citation accuracy > 95%

3. MCP Server Development

  • Build MCP servers over structured datasets: deal terms, fund performance, financials, market comps
  • Design tool definitions with strict schemas: function signatures, parameter types, validation rules
  • Implement guardrails: confidence thresholds, schema validation, human-in-the-loop triggers
  • Example: MCP server for "Get deal covenants" must return structured JSON with covenant type, threshold, measurement period, breach conditions

4. System Observability & Quality

  • Build observability framework: log queries, retrieval candidates, tool calls, responses, latencies
  • Design RAG quality metrics: answer accuracy, citation precision, retrieval recall, hallucination rate
  • Implement A/B testing framework for prompt strategies and retrieval configurations
  • Create debugging dashboards for queryretrievalgeneration pipeline tracing
  • Target: <2s end-to-end latency (95th percentile), query success rate > 98%

5. Technical Leadership

  • Lead Alpha AI squad (4-6 engineers): guide architectural decisions, review designs, unblock technical challenges
  • Conduct design reviews for all AI/ML features across Analyst Platform and Insights Engine
  • Define AI/ML platform standards: model serving, versioning, monitoring, rollback procedures
  • Mentor full-stack engineers on LLM integration patterns, prompt engineering, and AI best practices
  • Partner with Head of R&D on research direction: what models to fine-tune, what benchmarks to chase

Must-Have Qualifications

Experience & Expertise:

  • 8+ years in AI/ML engineering with 3+ years designing production RAG or conversational AI systems
  • Proven track record architecting and deploying LLM-powered applications serving >10K users or $10M+ revenue
  • Deep expertise in retrieval systems: vector databases (Pinecone/Weaviate/Qdrant), embedding models, hybrid search, re-ranking
  • Strong understanding of LLM architectures, fine-tuning, and prompt engineering (few-shot, CoT, ReAct patterns)
  • Experience building agentic workflows with tool use, function calling, and multi-step reasoning

Technical Skills:

  • Vector databases: Production experience with Vespa, Pinecone etc
  • LLM frameworks: LangChain, LlamaIndex, or custom orchestration for complex workflows
  • Embedding models: SentenceTransformers, OpenAI embeddings, domain-specific fine-tuning
  • MCP or similar protocols: Experience building structured data access layers for LLMs
  • Python (expert level): FastAPI, asyncio, Pydantic for API design and validation
  • Evaluation frameworks: RAGAS, LangSmith, custom benchmarking for RAG quality

Leadership:

  • Experience leading technical teams (4+ engineers) through architecture design and implementation
  • Track record of setting technical standards and conducting design reviews across multiple products
  • Ability to translate business requirements into technical architecture and success metrics

Added Advantage

Domain & Scale:

  • Experience in FinTech, credit analysis, or financial data platforms (understanding of deals, covenants, financials)
  • Built systems processing >1M documents or 100GB+ knowledge bases
  • Familiarity with private credit, structured finance, or asset management workflows

Advanced Techniques:

  • Fine-tuned Small Language Models for domain-specific tasks (e.g., financial entity extraction, classification)
  • Implemented query decomposition and planning
  • Built multi-modal RAG systems (text + tables + charts)
  • Experience with prompt optimization at scale (DSPy, automatic prompt tuning)

Infrastructure:

  • Designed model serving infrastructure: inference optimization, batching, caching, A/B deployment
  • Built real-time streaming pipelines for document ingestion and embedding generation
  • Experience with Kubernetes, Docker, and ML Ops tooling (MLflow, Weights & Biases)

Research:

  • Published papers or blog posts on RAG, retrieval, or agentic AI
  • Contributed to open-source projects in LLM tooling or vector search


Who We Are-

Alphastream.ai envisions a dynamic future for the financial world, where innovation is propelled by state-of-the-art AI technology and enriched by a profound understanding of credit and fixed-income research. Our mission is to empower asset managers, research firms, hedge funds, banks, and investors with smarter, faster, and curated data. We provide accurate, timely information, analytics, and tools across simple to complex financial and non-financial data, enhancing decision-making. With a focus on bonds, loans,financials and sustainability, we offer near real-time data via APIs and PaaS (Platform as a Service) solutions that act as the bridge between our offerings and seamless workflow integration.

To learn more about us: https://alphastream.ai/

What we offer

"At Alphastream.ai we offer a dynamic and inclusive workplace where your skills are valued and your career can flourish. Enjoy competitive compensation, a comprehensive benefits package, and opportunities for professional growth. Immerse yourself in an innovative work environment, maintain a healthy work-life balance, and contribute to a diverse and inclusive culture. Join us to work with cutting-edge technology, and be part of a team that recognizes and rewards your achievements, all while fostering a fun and engaging workplace culture."

Disclaimer-

Alphastream.ai is an equal opportunities employer. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of all communities in which we operate and who reflect a variety of backgrounds, talents, perspectives, and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing, and advancing individuals based on their skills and talents.

Services you might be interested in

Improve Your Resume Today

Boost your chances with professional resume services!

Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.