🔔 FCM Loaded

AI/ML Engineer (DLMs, Embeddings, Fine-Tuning)

B4B SOLUTIONS ™

2 - 5 years

Hyderabad

Posted: 17/02/2026

Getting a referral is 5x more effective than applying directly

Job Description

Company Overview for Client
On behalf of a stealth-stage DeepTech AI company. This role is being recruited for a client that is transforming a major, multibillion dollar industry via proprietary AI.

Company Summary
We are representing a highly ambitious stealth DeepTech AI company operating across multiple international markets. Their core strength is a proprietary Deep Learning Model (DLM), fine-tuned on massive, domain-specific data to deliver performance superior to general-purpose foundation models. The company focuses on solving critical operational inefficiencies by providing AI-first automation, secure multi-platform tooling, and multilingual support. They are scaling rapidly toward significant ARR targets with strong unit economics and are positioned to capture a large market opportunity.

AI/ML Engineer (DLMs, Embeddings, Fine-Tuning) - Immediate Joins preferred
Job Title: AI/ML Engineer
Employment Type: Full-Time
Location: Hyderabad, India
Experience: 4+ Years in similar role.

Role Summary
You will be the core technical driver behind the clients competitive moat: the proprietary Deep Learning Model (DLM). This is a hands-on, highly technical role focused on achieving and maintaining domain-specific accuracy and cost-efficiency that generic models cannot replicate. You will own the end-to-end lifecycle of models from data ingestion and fine-tuning to deployment and optimization ensuring the platform delivers industry-leading accuracy (targeting 95% on complex domain benchmarks) while operating at a highly competitive cost advantage.

Key Responsibilities
  • Proprietary Model Fine-Tuning: Lead fine-tuning and customization of Large Language Models (LLMs) or similar Deep Learning Models using techniques such as LoRA/PEFT for domain-specific performance.
  • Design and manage high-volume data ingestion and cleaning pipelines; implement automated QA checks and coordinate expert corpus review.
  • Develop, manage, and optimize embeddings generation and vector search workflows; integrate with vector databases (e.g., Qdrant or similar) to enable accurate Retrieval-Augmented Generation (RAG).
  • Cost & Performance Moat: Optimize model inference for maximum throughput and minimal cost-per-query, specifically targeting operations that are up to 1000x cheaper than large commercial general-purpose models using techniques like distillation and quantization.
  • Implement and refine core AI algorithms for specialized tasks such as predictive insights and automated content extraction.
  • Collaborate on deploying and monitoring AI microservices in production with a focus on scalability, reliability, and observability.

Required Technical Skills
  • 4+ years of experience in AI/ML with deep specialization in LLMs, NLP, and fine-tuning techniques
  • Persona: Must be a highly autonomous, senior individual contributor with a commitment to engineering and optimizing proprietary, deep-tech IP
  • Expert-level Python and strong proficiency with PyTorch or TensorFlow
  • Practical experience with vector databases, embeddings pipelines, and RAG architectures
  • Demonstrated ability to optimize models for production deployment including GPU resource management, quantization, and distillation
  • Strong knowledge of data cleaning methodologies and advanced ML algorithms

What We Offer

  • We offer a competitive salary aligned with the Hyderabad startup market.
  • Continuous learning support including conference allowance and learning resource
  • Collaborative, lean team culture where decisions move fast and contributions are visible.
  • Opportunity for rapid career growth as the company scales and secures funding.
How to Apply
  • Please apply via LinkedIn Easy Apply or email with the following:
  • Updated CV highlighting relevant LLM, embeddings, and fine-tuning work
  • Short cover note (23 paragraphs) describing a recent project where you improved model accuracy or cost-efficiency; include the approach, tools, and measurable outcome.
  • Links to project artifacts (GitHub, Colab notebooks, papers, demo videos) if available.

Candidates who pass initial screening will be asked to answer 23 short technical questions and provide a concise case write-up or code sample demonstrating relevant experience.

Join Us and Make an Impact

By joining this team, you will contribute directly to a platform driving significant efficiency gains and reducing friction across large industry workflows. The client seeks professionals ready to commit, innovate, and scale as they pursue market leadership and industry transformation.

#DeepTech #AIMLEngineer #LLM #Quantization #RAG #HyderabadJobs #Startup #LegalTech

Services you might be interested in

Improve Your Resume Today

Boost your chances with professional resume services!

Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.