Job Title: AI Engineer

Experience: 5+ Years

Location: Onsite - HSR Layout, Banglore

Employment Type: Permanent Hire

Notice Period: Immediate Joiners Preferred

Overview

We are looking for an AI Engineer Researcher to drive core intelligence, focusing on building efficient, production-grade agentic AI systems.This role sits at the intersection of applied research and real-world deployment, with a strong emphasis on Small Language Models (SLMs). You will contribute to advancing model efficiency, domain-specific training, and scalable deployment across industries such as robotics, industrial automation, and edge AI.You will play a key role in translating cutting-edge AI research into practical, deployable systems that operate directly on devices and embedded environmentsnot just in the cloud.

Key Responsibilities

Design and develop Small Language Models (SLMs) tailored for domain-specific applications
Build and optimize fine-tuning pipelines using techniques like LoRA, PEFT, and quantization
Implement model compression strategies such as distillation, pruning, and quantization-aware training
Develop and manage prompt engineering workflows and agent orchestration systems
Optimize models for low-latency, high-throughput inference in production environments
Deploy AI models on edge devices and embedded systems
Improve model performance across latency, accuracy, and cost efficiency
Align models with hardware constraints (GPU, CPU, edge accelerators)
Design evaluation frameworks to measure model reliability and effectiveness
Collaborate with cross-functional teams to integrate AI into real-world systems

Skills & Requirements

5+ years of experience in AI/ML, with a focus on applied research or production systems
Strong experience in SLM/LLM development and optimization
Hands-on expertise in fine-tuning techniques (LoRA, PEFT, quantization)
Solid understanding of model compression techniques (distillation, pruning, QAT)
Experience in prompt engineering and agent-based systems
Proficiency in optimizing inference performance (latency, throughput)
Experience with edge AI deployment (on-device / embedded systems)
Knowledge of hardware-aware optimization across GPUs, CPUs, and accelerators
Strong programming skills in Python and ML frameworks (e.g., PyTorch, TensorFlow)
Ability to work in a fast-paced, high-ownership environment

AI Engineer

RapidBrains

Job Description

Services you might be interested in

Improve Your Resume Today