Senior Data Engineer
NomadX
5 - 10 years
Delhi
Posted: 31/01/2026
Job Description
About NomadX ( ):
NomadX is transforming the food safety industry utilizing three proprietary groundbreaking technologies to ensure that no contaminated product leaves a processing facility.
1) NomadX Decontamination from Farm to Processing is more effective, sustainable, and safe for humans, animals, food, and the environment.
2) NomadX Sample Collection & Concentration Method boasts improvement in sample collection efficiency, which delivers same-day pathogen detection results, enabling real-time containment.
3) NomadX Detection & Characterization is highlighted by enhanced spectroscopy that detects whole, viable bacteria with little to no preparation and enables detection with same-day results.
NomadX aims to revolutionize global food safety standards. Join us in leading this change.
About the Role:
NomadX is seeking a highly motivated Senior Data Engineer to join our team. . As a Sr. Data Engineer, you will design, build, and maintain software that powers automated instruments and real-time data analysis, directly impacting public health.
Qualifications
- B. Tech or masters in data science , Computer Science, AI/ML, Computational Chemistry, Bioinformatics, or a related quantitative field.
- 5+ years of hands-on experience if B. Tech and 3+ years if Masters in machine learning, deep learning, or data science, with demonstrable work in spectral analysis (NIR, FTIR, Raman, UV-Vis, MS, NMR, or hyperspectral data).
- Strong proficiency with post-2022 AI/DL ecosystems , including:
- PyTorch 2.x, TensorFlow 2.12+, JAX
- Hugging Face Transformers (2023+), Diffusers
- Vision Transformers, Spectral Transformers
- LoRA, QLoRA, Adapter Training
- Experience with signal and spectral processing workflows, including denoising, baseline correction, SNR improvement, peak detection, and normalization using SciPy 1.10+, CuPy, RAPIDS, or PyTorch DataPipes.
- Strong foundations in statistics, linear algebra, optimization, and numerical methods .
- Proficiency in Python and modern data libraries (Polars, Pandas 2.x, NumPy 2.x).
- Experience building end-to-end ML pipelines using MLflow 2.x , DVC 3.x, or equivalent MLOps tools.
- Hands-on experience with GPU acceleration , TensorRT, ONNX Runtime, PyTorch Compile, or quantization for real-time inference.
- Ability to work with large, complex, and noisy datasets in laboratory or industrial environments.
- Knowledge of self-supervised learning (MAE, SimCLR, BYOL, MoCo v3, DINOv2).
- Experience integrating multimodal AI models , combining spectra + metadata + imaging.
- Familiarity with cloud platforms (AWS Sagemaker, Azure ML, GCP Vertex AI) and GPU cluster management (Ray, Lightning Fabric, Kubernetes).
- Experience with explainable AI for spectral data: such as SHAP 2.x, Captum, Grad-CAM++, Spectral Attribution Maps.
- Experience with Bayesian modeling , uncertainty quantification, or probabilistic programming (Pyro, NumPyro).
- Understanding of chemometric techniques (PLS, PCA/ICA, multivariate curve resolution, classical least squares).
- Familiarity with instrument control, automation , or real-time spectral data acquisition systems.
- Excellent communication skills with the ability to work cross-functionally with chemists, biologists, engineers, and product teams.
Benefits:
- Competitive compensation
- Medical insurance, Term insurance
- Flexible paid time off, paid holidays
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
