Find Your Next Job

Ai Data Engineer

Posted on May 27, 2026

  • Ka, India
  • 0 - 0 USD (yearly)
  • Full Time

Ai Data Engineer job opportunity

Tailor Your Resume for this Job


AI Data Engineer


This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office.

Who We Are:

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description:


HPE Financial services is where we help organizations create the investment they need for digital transformation, in an innovative and sustainable way. We partner with customers across their entire IT asset portfolio from edge to cloud to end-user. Unique to each client’s aspirations and size, our financial and asset management solutions are anchored by best-in-class tech upcycling services. Join us redefine what’s next for you.

Role summary

We are looking for a technically sharp and detail-oriented Data Engineer to join HPEFS (Hewlett Packard Enterprises Financial Services - Advanced Analytics & BI team Bangalore. This role is the data backbone that powers our AI capabilities — working in close partnership with the AI Engineers to ensure that the data flowing into AI models, dashboards, and business workflows is clean, governed, and well-structured. This role will play a hands on role and own the backend data lifecycle: ingesting raw data from diverse sources, transforming it into reliable, analysis-ready datasets, enforcing data quality standards, and publishing governed data products via Microsoft Fabric and Databricks. You will also support reporting needs through Power BI and contribute to Collibra-based data governance initiatives. A working familiarity with Microsoft Copilot and AI-assisted data tooling is expected

What you'll do:

Data Engineering & Transformation

  • Design, build, and maintain scalable ETL/ELT pipelines using Azure Data Factory, Databricks (PySpark / Delta Live Tables), and Microsoft Fabric Data Factory.

  • Transform raw, multi-source data into clean, conformed, and analytics-ready datasets following Medallion Architecture principles (Bronze Silver Gold).

  • Develop and optimize SQL and PySpark-based transformation logic for structured, semi-structured, and unstructured data.

  • Implement incremental load patterns, merge/upsert logic, and slowly changing dimension (SCD) strategies to support historical data tracking.

  • Collaborate with the AI Engineers to prepare high-quality feature datasets for ML and LLM use cases.

Data Quality & Governance

  • Define, implement, and monitor data quality rules including completeness, accuracy, consistency, timeliness, and uniqueness checks.

  • Administer and extend the Collibra data governance platform — including business glossary management, data lineage documentation, and stewardship workflows.

  • Build automated data quality validation frameworks using tools such as Great Expectations, dbt tests, or Unity Catalog data quality constraints in Databricks.

  • Triage and resolve data quality incidents, root-cause data anomalies, and communicate impact to stakeholders proactively.

  • Maintain metadata catalogues and ensure all critical datasets have documented ownership, lineage, and classification.

Microsoft Fabric & Lakehouse

  • Build and manage Lakehouses, Warehouses, and Dataflows Gen2 within the Microsoft Fabric ecosystem.

  • Configure OneLake, shortcuts, and mirroring to unify data across sources without unnecessary duplication.

  • Leverage Fabric Notebooks (PySpark / Python) and Spark job definitions for large-scale data processing.

  • Support the semantic model layer in Fabric to ensure Power BI datasets are optimized and governed

Power BI & Reporting

  • Develop and maintain Power BI semantic models (star schema design, DAX measures, row-level security).

  • Build production-grade dashboards and reports for business stakeholders; ensure refresh reliability and performance.

  • Apply Copilot-assisted authoring in Power BI and Fabric where applicable to accelerate report generation.

  • Support self-service analytics adoption by publishing governed datasets to the Power BI service

Collaboration & AI Enablement

  • Partner closely with the AI Engineers, peer data scientist and analytics team members to supply clean, structured data for RAG pipelines, model training, and agentic workflows.

  • Contribute to the design of shared data contracts and API schemas between data engineering and AI engineering layers.

  • Assist with AI-assisted data tasks using Microsoft Copilot (in Fabric, Power BI, and Azure environments).

What you need to bring:

Qualifications

  • Bachelor's or Master's degree in Computer Science, Information Systems, Data Engineering, Mathematics, or a related discipline.

  • 4 – 5 years of hands-on experience in data engineering, ETL development, or analytics engineering roles.

  • Demonstrable experience with Databricks and/or Microsoft Fabric in a production environment.

  • Proficiency in Power BI report and semantic model development.

  • Exposure to Collibra or equivalent data governance / cataloguing platforms is strongly preferred.

  • Strong SQL and Python skills; PySpark experience is required.

  • Familiarity with Azure cloud services and DevOps practices for data pipeline deployment

Technical Skill Requirements

  • Data Platforms - Databricks (PySpark, Delta Lake, Delta Live Tables, Unity Catalog), Microsoft Fabric (Lakehouse, Warehouse, Dataflows Gen2, Notebooks), Azure Data Lake Storage Gen2

  • Data Transformation - PySpark, SQL, dbt (data build tool), Azure Data Factory, Fabric Data Factory; Medallion Architecture, SCD types, incremental load patterns

  • Data Modelling - Star schema, snowflake schema, dimensional modelling, data vault concepts; normalization, entity-relationship design, semantic layer design

  • Reporting & BI - Power BI (DAX, semantic models, RLS, Power Query / M), Microsoft Fabric Power BI integration, Copilot-assisted authoring in Power BI

  • Programming - Python (primary), SQL (advanced); PySpark; familiarity with JSON, Parquet, Delta file formats

  • Cloud & DevOps - Azure (preferred): Synapse, ADF, ADLS Gen2, Key Vault; Git/GitHub for version control; CI/CD basics for pipeline deployment

  • Data Governance & Cataloguing - data lineage documentation, metadata management, data classification and tagging, business glossary ownership

  • AI & Copilot Tooling - Microsoft Copilot in Fabric / Power BI; familiarity with AI-assisted data transformation; understanding of LLM data requirements (embeddings, chunking, vector-ready formats)

  • Data Concepts - Data warehousing, lakehouse architecture, OLAP vs OLTP, event-driven ingestion, streaming basics (Structured Streaming / Event Hubs), data contracts, master data management (MDM)

#Financialservices

Additional Skills:

Accountability, Accountability, Action Planning, Active Learning, Active Listening, Agile Methodology, Agile Scrum Development, Analytical Thinking, Bias, Coaching, Creativity, Critical Thinking, Cross-Functional Teamwork, Data Analysis Management, Data Collection Management (Inactive), Data Controls, Design, Design Thinking, Empathy, Follow-Through, Group Problem Solving, Growth Mindset, Intellectual Curiosity (Inactive), Long Term Planning, Managing Ambiguity {+ 5 more}

What We Can Offer You:

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Unconditional Inclusion

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Let's Stay Connected:

Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

#india

Job:

Engineering

Job Level:

TCP_03


HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.

Hewlett Packard Enterprise is EEO Protected Veteran/ Individual with Disabilities.


HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.


Recruitment Fraud Alert

We have become aware of an increase in fraudulent recruitment activities in which individuals impersonate our company or authorized recruitment agencies to offer fake employment opportunities. These scams may occur through false websites, emails, social media, or chat-based applications and often aim to obtain personal information or money. Please note that Hewlett Packard Enterprise (HPE), its direct and indirect subsidiaries and affiliated companies, and its authorized recruitment agencies/vendors will never charge a candidate a registration fee, hiring fee, or any other fee in connection with its recruitment and hiring process. We also never request personal information such as back account details, Social Security numbers, or national IDs via social media or chat applications.

All legitimate job opportunities will come through official company channels, and candidates are responsible for verifying the credentials of any third party claiming to represent the company. Any reliance on fraudulent communication is at the individual’s own risk, and HPE disclaims legal liability for any resulting damages. If you suspect recruitment fraud, do not share personal information or make any payments and report the incident to your local authorities immediately.


Tailor Your Resume for this Job


Share with Friends!

Similar Jobs


Hutch O'Malley Consulting Limited logo Hutch O'Malley Consulting Limited

Civil/Structural Design Engineer

About the CompanyHutch O’Malley is a leading engineering consulting company. We are committed…

Full Time | Mungairit, Ireland

Apply 1 hour, 21 minutes ago

Mace Group logo Mace Group

Senior Planning/Scheduler Manager - Energy And Infrastructure Sector

Mace combines construction expertise with consultancy to unlock potential in every person or projec…

Full Time | Cork, Ireland

Apply 1 hour, 21 minutes ago

Deciphex logo Deciphex

People Operations Business Partner

Location: Ireland based ideally but will look at UK Occasional travel to Dublin HQ & UK site (…

Full Time | Remote, Ireland

Apply 1 hour, 21 minutes ago

Hewlett Packard Enterprise | HPE logo Hewlett Packard Enterprise | HPE

Senior Ai & Data Engineer

Senior AI & Data Engineer This role has been designed as ‘’Onsite’ with an ex…

Full Time | Ka, India

Apply 1 hour, 25 minutes ago

Hewlett Packard Enterprise | HPE logo Hewlett Packard Enterprise | HPE

Applied Ai Engineer

Applied AI Engineer This role has been designed as ‘’Onsite’ with an expectation …

Full Time | Ka, India

Apply 1 hour, 25 minutes ago

Hewlett Packard Enterprise | HPE logo Hewlett Packard Enterprise | HPE

Senior Ai Data Engineer

Senior AI Data Engineer This role has been designed as ‘’Onsite’ with an expectat…

Full Time | Ka, India

Apply 1 hour, 25 minutes ago

SPX Technologies logo SPX Technologies

Automation Specialist

Organisation- StreampreneurX (spx.streampreneurx.com)StreampreneurX is a creator-first agency helpi…

Full Time | Remote, India

Apply 1 hour, 25 minutes ago

Airbus logo Airbus

Business Analyst - Methods & Tools

Job Description: About the Role: We are seeking a Business Analyst to join our Methods and Tools te…

Full Time | Ka, India

Apply 1 hour, 25 minutes ago