Data Architect (Databricks, BigQuery)
TalentXO
2 - 5 years
Alipur
Posted: 29/05/2026
Getting a referral is 5x more effective than applying directly
Job Description
Hiring for a Client - Auxo AI
Role & Responsibilities
- Lead enterprise-scale implementation of data warehouse data platforms on Databricks and Snowflake environments.
- Design and implement Medallion (Bronze/Silver/Gold) architecture and scalable enterprise data models.
- Establish data modeling standards (dimensional, data vault, lakehouse patterns) and ensure best practices across projects
- Establish enterprise data governance frameworks including cataloging, lineage, stewardship, and compliance using Atlan.
- Define and implement CI/CD pipelines for infrastructure and data platform deployments
- Design data architectures that support AI/ML and Generative AI workloads including vector storage, feature layers, and secure access patterns.
- Build scalable ingestion frameworks supporting batch, streaming, and CDC pipelines.
- Architect secure, high-performance data integration layers for analytics, BI, and AI consumption.
- Develop target-state architecture blueprints and enforce data standards, governance, and best practices across teams.
- Collaborate with engineering, analytics, and data science teams to ensure platform alignment and scalability.
- Engage with clients as a trusted advisor, driving data strategy, roadmap definition, and identifying opportunities for expansion.
Ideal Candidate
- Strong Databricks / AWS Data Architect profile
- Mandatory (Experience 1) Must have minimum 8+ years of experience in Data Architecture / Data Engineering, with exposure in enterprise-scale data platform modernization initiatives
- Mandatory (Experience 2) Must have minimum 3+ years of deep hands-on experience in Databricks-based lakehouse architecture on AWS, including large-scale data platform implementations
- Mandatory (Experience 3) Strong expertise in Databricks ecosystem including Delta Lake, Databricks SQL, Unity Catalog, Delta Live Tables, and MLflow with focus on performance optimization and security
- Mandatory (Experience 4) Strong experience with AWS data services including S3, Glue, EMR, Lambda, Redshift, Athena, Lake Formation, and DMS, with strong understanding of cloud-native architecture patterns
- Mandatory (Experience 5) Proven experience designing and implementing Medallion (Bronze/Silver/Gold) architecture, scalable data models (Dimensional/Data Vault), and enterprise lakehouse platforms supporting batch and real-time processing
- Mandatory (Experience 6) Must have hands-on experience building scalable ingestion frameworks including batch, streaming, and CDC pipelines using tools like Kafka, Kinesis, Spark, or similar technologies
- Mandatory (Skill 1) Proven experience implementing CI/CD pipelines for data platforms, including infrastructure as code, automated deployments, and environment management
- Mandatory (Skill 2) Hands-on experience enabling data platforms for AI/ML and Generative AI use cases, including feature stores, vector storage, and secure data access patterns
- Mandatory (Skill 3) Experience with orchestration tools such as Apache Airflow or MWAA and designing integration layers for analytics, BI, and AI consumption
- Preferred (Company) Product Companies.
- Preferred (Certification) AWS / Databricks / Snowflake certifications; experience with Snowflake alongside Databricks; exposure to MDM, data quality frameworks, and enterprise metadata tools
Services you might be interested in
We Search & Apply Jobs for You!
Our team scans through 1000s of opportunities and applies to roles best suited to your profile
Save 100+ hours and focus on what matters - cracking interviews and landing offers.
