Senior Machine Learning Engineer
WhiteLotus Talent Partners
5 - 10 years
Mumbai
Posted: 12/02/2026
Job Description
Job Title- PyTorch & Python ML Engineer
Location- Mumbai/Bangalore/Hyderabad/Gurgaon/Indore
Mode- Onsite
Job Type: Full-Time
Core Responsibilities
The role designs and owns end-to-end ML pipelines implemented in Python, covering audio data ingestion, preprocessing, feature extraction, training invocation, evaluation, and artifact generation.
The role is responsible for implementing and maintaining large-scale speech models using PyTorch, including transformer-based architectures for speech-to-text and text-to-speech, ensuring correctness, performance, and extensibility.
The role owns audio data ETL and dataset management workflows, including data validation, labeling interfaces, dataset versioning, and traceability between datasets and trained models.
The role ensures that model implementations are training- and inference-compatible, avoiding mismatches in tensor shapes, precision, preprocessing, or runtime assumptions.
The role works closely with training orchestration teams to ensure that model code is structured for distributed training, checkpointing, and resumability.
The role collaborates with inference and GPU teams to export trained models into production-ready formats and integrate them into high-performance inference services.
The role establishes coding standards, testing practices, and performance guidelines for Python-based ML code to ensure maintainability over long platform lifecycles.
Operational Ownership
The PyTorch & Python ML Engineering Lead owns ML implementation failures. If a model underperforms or behaves incorrectly due to bugs, mismatched preprocessing, or incorrect integration, this role is accountable.
The role owns the model lifecycle integrity, ensuring that trained artifacts can always be traced back to data, code versions, and training configurations.
The role is accountable for ensuring that ML pipelines are repeatable and debuggable, especially when issues arise in production.
The role participates in production incident analysis when issues stem from model implementation, data handling, or ML pipeline Behaviour.
Key Interfaces
This role works closely with the Distributed Systems Architect and Technical Governance & Platform Solutions Lead to ensure ML workflows align with architectural and design constraints.
The role interfaces daily with the PyTorch Lightning Training Lead to ensure that model code is structured correctly for scalable training workflows.
The role collaborates with the Speech Modelling & Quality Lead to translate quality improvement goals into concrete model changes and data requirements.
The role works with the GPU Inference Optimization Lead to ensure model artifacts are compatible with inference runtimes, precision requirements, and latency budgets.
The role coordinates with GitOps, CI/CD & Reliability teams to ensure ML pipelines integrate cleanly with build, deployment, and observability systems.
Explicit Non-Responsibilities
This role does not define speech quality targets or language-level accuracy goals; those are owned by the Speech Modelling & Quality Lead.
This role does not manage GPU infrastructure, Kubernetes scheduling, or inference service scaling.
This role does not own product-level feature definition or API contracts.
Role Expectation
The PyTorch & Python ML Engineering Lead is expected to ensure that machine learning development is treated as production engineering, not experimentation.
Success in this role is measured by:
Stable, maintainable ML codebases
Reproducible training and evaluation outcomes
Smooth transition from training to inference
Minimal production issues caused by ML pipeline defects
Ability to support rapid model iteration without system instability
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
