About Us:

Virallens is a dynamic, techdriven organisation helping businesses scale, innovate and lead through powerful AI solutions. We build intelligent systems powered by stateoftheart generative AI technologies across diverse industries. Our collaborative, fastpaced environment is perfect for innovators eager to have an immediate impact.

Role overview

Were seeking an AI Engineer who will design and deploy intelligent systems that leverage large language models (LLMs), retrievalaugmented generation (RAG) and vector databases to solve complex enterprise problems. You will build endtoend pipelines for ingesting, encoding and indexing data, integrate knowledge graphs and hybrid retrieval strategies, and optimize models for latency, accuracy and cost.

Key responsibilities

Design, prototype & deploy retrievalaugmented generation systems: Architect scalable RAG pipelines that combine vector search, hybrid retrieval, reranking and contextual compression techniques. Build and integrate vector search systems (e.g.Milvus, pgvector, FAISS, Weaviate) for highrecall retrieval across structured and unstructured data.
Develop hybrid retrieval and knowledgedriven pipelines: Design hybrid retrieval systems that blend semantic, symbolic and graphbased methods. Create custom chunking and encoding strategies to store operational knowledge in vector databases and knowledge graphs.
Build knowledge graphs & integrate them into retrieval workflows: Architect knowledge graphs (Neo4j, RDF, custom schemas) and integrate them into retrieval workflows to support reasoning and decisionmaking.
Optimise data pipelines and embeddings: Build and optimise data pipelines that convert incoming documents into highquality embeddings for AI retrieval. Tune chunk sizes, indexing frequencies and embedding strategies to enhance recall, factual accuracy and efficiency.
Implement hybrid search & metadata filtering: Combine semantic and keyword search to improve precision and efficiency. Experiment with metadata filtering techniques to surface the most relevant context for AI reasoning agents.
Evaluate & monitor system performance: Evaluate endtoend retrieval performance using classical IR metrics (precision, recall) and LLMspecific evaluations (factuality, coherence, task success). Monitor retrieval logs and adjust embedding configurations to maintain relevance and mitigate hallucinations.
Compare & finetune LLMs: Compare performance of different LLMs (e.g.GPT4, Claude, Llama) across embedding structures and refine tuning strategies. Implement quantisation, distillation and optimisation techniques to meet latency, throughput and cost targets.
Collaborate & enable teams: Work crossfunctionally with product managers, data engineers and domain experts to translate product goals into scalable AI solutions. Conduct workshops and enablement sessions to enhance AI literacy across internal teams.
Ensure quality & compliance: Participate in rigorous code reviews and implement testing frameworks to ensure reliability, security and compliance. Continuously monitor model accuracy and safety, and uphold data governance and ethical guidelines.

Required qualifications

Experience: 5+ years of software engineering experience with deep expertise in Python, experience building and deploying RAG or information-retrieval systems, and strong proficiency in TensorFlow and PyTorch.
Retrieval expertise: Demonstrated ability to design hybrid retrieval pipelines, encode knowledge using LLMs and vector stores, and build and optimise RAG systems.
Vector databases & search algorithms: Proficiency with vector databases and search libraries such as pgvector, FAISS, Milvus, Pinecone or Weaviate, and strong understanding of vector search algorithms, indexing strategies and hybrid search techniques.
Embedding & LLM frameworks: Handson experience with embeddings and transformerbased models (e.g.OpenAI, Cohere, Sentence Transformers) and frameworks such as HuggingFace Transformers, LangChain and LlamaIndex.
Distributed systems & deployment: Practical knowledge of distributed systems, ETL pipelines, Docker and Kubernetes, along with cloud platforms (Azure, AWS, GCP) for deploying AI applications.
Evaluation & security: Familiarity with evaluation of retrieval systems, observability tools and model performance monitoring. Understanding of data governance, security and compliance considerations.

Preferred qualifications

Knowledge graphs & multimodal search: Experience designing and deploying knowledge graphs, semantic graphs or multimodal search systems.
Finetuning & RLHF: Familiarity with LLM finetuning, reinforcement learning from human feedback (RLHF) and safety alignment.
Multimodal AI: Exposure to multimodal models (image, video, audio) and diffusion models.
Opensource contributions: Contributions to opensource generative AI, retrieval or vector database projects, or published research/blogs.
Frontend prototyping: Experience with React/Next.js for rapid prototyping of AIdriven applications.
Advanced degrees: Masters or PhD in Computer Science, AI, Machine Learning or related fields (preferred but not mandatory). Extensive relevant experience or significant opensource contributions may substitute formal education.

Why join Virallens?

Innovationdriven environment: Work on cuttingedge AI solutions and shape the future of enterprise AI.
Immediate impact: See your work deployed rapidly across realworld applications.
Collaborative culture: Your ideas directly influence our AI initiatives and product roadmap.
Growth opportunities: Engage with diverse projects, develop new skills and advance your career in a highgrowth domain.

If youre passionate about advancing AI, love solving complex information retrieval problems and are eager to drive innovation across industries, wed love to hear from you.

AI Engineer

Virallens

Job Description

Services you might be interested in

Improve Your Resume Today