AI Engineer
RapidBrains
2 - 5 years
Bengaluru
Posted: 20/04/2026
Job Description
Job Title: AI Engineer
Experience: 5+ Years
Location: Onsite - HSR Layout, Banglore
Employment Type: Permanent Hire
Notice Period: Immediate Joiners Preferred
Overview
We are looking for an AI Engineer Researcher to drive core intelligence, focusing on building efficient, production-grade agentic AI systems.This role sits at the intersection of applied research and real-world deployment, with a strong emphasis on Small Language Models (SLMs). You will contribute to advancing model efficiency, domain-specific training, and scalable deployment across industries such as robotics, industrial automation, and edge AI.You will play a key role in translating cutting-edge AI research into practical, deployable systems that operate directly on devices and embedded environmentsnot just in the cloud.
Key Responsibilities
- Design and develop Small Language Models (SLMs) tailored for domain-specific applications
- Build and optimize fine-tuning pipelines using techniques like LoRA, PEFT, and quantization
- Implement model compression strategies such as distillation, pruning, and quantization-aware training
- Develop and manage prompt engineering workflows and agent orchestration systems
- Optimize models for low-latency, high-throughput inference in production environments
- Deploy AI models on edge devices and embedded systems
- Improve model performance across latency, accuracy, and cost efficiency
- Align models with hardware constraints (GPU, CPU, edge accelerators)
- Design evaluation frameworks to measure model reliability and effectiveness
- Collaborate with cross-functional teams to integrate AI into real-world systems
Skills & Requirements
- 5+ years of experience in AI/ML, with a focus on applied research or production systems
- Strong experience in SLM/LLM development and optimization
- Hands-on expertise in fine-tuning techniques (LoRA, PEFT, quantization)
- Solid understanding of model compression techniques (distillation, pruning, QAT)
- Experience in prompt engineering and agent-based systems
- Proficiency in optimizing inference performance (latency, throughput)
- Experience with edge AI deployment (on-device / embedded systems)
- Knowledge of hardware-aware optimization across GPUs, CPUs, and accelerators
- Strong programming skills in Python and ML frameworks (e.g., PyTorch, TensorFlow)
- Ability to work in a fast-paced, high-ownership environment
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
