Login Sign Up

AI Engineer

RapidBrains

2 - 5 years

Bengaluru

Posted: 20/04/2026

Getting a referral is 5x more effective than applying directly

Job Description

Job Title: AI Engineer

Experience: 5+ Years

Location: Onsite - HSR Layout, Banglore

Employment Type: Permanent Hire

Notice Period: Immediate Joiners Preferred


Overview

We are looking for an AI Engineer Researcher to drive core intelligence, focusing on building efficient, production-grade agentic AI systems.This role sits at the intersection of applied research and real-world deployment, with a strong emphasis on Small Language Models (SLMs). You will contribute to advancing model efficiency, domain-specific training, and scalable deployment across industries such as robotics, industrial automation, and edge AI.You will play a key role in translating cutting-edge AI research into practical, deployable systems that operate directly on devices and embedded environmentsnot just in the cloud.


Key Responsibilities

  • Design and develop Small Language Models (SLMs) tailored for domain-specific applications
  • Build and optimize fine-tuning pipelines using techniques like LoRA, PEFT, and quantization
  • Implement model compression strategies such as distillation, pruning, and quantization-aware training
  • Develop and manage prompt engineering workflows and agent orchestration systems
  • Optimize models for low-latency, high-throughput inference in production environments
  • Deploy AI models on edge devices and embedded systems
  • Improve model performance across latency, accuracy, and cost efficiency
  • Align models with hardware constraints (GPU, CPU, edge accelerators)
  • Design evaluation frameworks to measure model reliability and effectiveness
  • Collaborate with cross-functional teams to integrate AI into real-world systems


Skills & Requirements

  • 5+ years of experience in AI/ML, with a focus on applied research or production systems
  • Strong experience in SLM/LLM development and optimization
  • Hands-on expertise in fine-tuning techniques (LoRA, PEFT, quantization)
  • Solid understanding of model compression techniques (distillation, pruning, QAT)
  • Experience in prompt engineering and agent-based systems
  • Proficiency in optimizing inference performance (latency, throughput)
  • Experience with edge AI deployment (on-device / embedded systems)
  • Knowledge of hardware-aware optimization across GPUs, CPUs, and accelerators
  • Strong programming skills in Python and ML frameworks (e.g., PyTorch, TensorFlow)
  • Ability to work in a fast-paced, high-ownership environment

Services you might be interested in

Improve Your Resume Today

Boost your chances with professional resume services!

Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.