MCPNew: now works with Claude & AI assistants
aptean

aptean

Data Engineer

Company

aptean

Role

Data Engineer

Job type

Full-time

Found on Mokaru

1 week ago

Share this job

Salary

Not disclosed by employer

Job description

Overview

JOB DESCRIPTION

  • JOB IDENTIFICATION

APTEAN JOB TABLE

APTEAN JOB LEVEL: B

APTEAN JOB TITLE: Data Engineer

REPORTS TO Full line: Kshitij Bhatia Dotted line

  • GENERAL JOB SUMMARY

Aptean is seeking a hands-on, results-driven Data Engineer to design, build, and maintain scalable Enterprise data lakehouse using Microsoft Fabric , following the medallion architecture . This role is key to advancing our enterprise data platform, enabling analytics, Enterprise reporting, and AI-driven insights. The ideal candidate will have strong experience in data modeling, data lakehouse technologies , and a passion for leveraging AI to optimize and enrich data processes wherever possible.

  • SCOPE
  • ORGANIZATION (Indicate the job positions reporting to this role)

Full line

· Job (# of positions)

Dotted line

· Job (# of positions)

  • PRINCIPAL DUTIES AND RESPONSIBILITIES
  • Design and build robust, scalable data ingestion pipelines using Microsoft Fabric (Pipelines, Dataflows, Notebooks) to integrate data from Business Applications, databases, files, and APIs into the Lakehouse.
  • Perform deep source system analysis to define ingestion strategies that ensure data reliability, consistency, and observability, while applying metadata-driven design for automation.
  • Develop and maintain Delta Tables using the medallion architecture (bronze/silver/gold) to systematically cleanse, enrich, and standardize data for downstream consumption.
  • Implement comprehensive data quality checks (nulls, duplicates, schema drift, outliers, SCD types) and ensure data integrity across all transformation layers in the Lakehouse.
  • Apply governance practices including schema versioning, data lineage tracking, role-based access control (RBAC), and audit trails to ensure compliance, traceability, and secure data access.
  • Structure the gold layer and semantic model to support AI/ML use cases , ensuring datasets are enriched, contextualized, and optimized for AI agent consumption.
  • Develop and maintain AI-ready run flows and access patterns to enable seamless integration between the Lakehouse and AI agents for tasks such as prediction, summarization, and decision automation.
  • Leverage DevOps best practices for pipeline versioning, testing, deployment, and monitoring; proactively detect and resolve data integration and processing issues.
  • JOB SPECIFICATIONS

Education (Indicate the minimum level of education necessary for this position. Check all that apply and indicate specific degree as applicable to the side (e.g., Bachelor’s in Computer Science)

Required Preferred Degree/Certification

☒ ☐ Bachelor’s degree

☐ ☒ Master’s degree

☐ ☐ Ph.D.

☐ ☐ J.D. (law)

☐ ☐ Certification

☐ ☐ Registration

☐ ☐ Licensure

☐ ☐ Other

Work Experience

2-5 Year’s

Knowledge, Skills, Abilities & Competencies

  • Hands-on experience in designing and implementing data platforms, including data warehouses, lakehouse , and modern ETL/ELT pipelines .

· Working knowledge of Microsoft Fabric (Pipelines, Dataflows Gen2, Notebooks, Lakehouse) is strongly preferred.

  • Proven ability to build, deploy, and troubleshoot highly reliable, distributed data pipelines integrating structured and unstructured data from various internal systems and external sources.

· Working knowledge of the medallion architecture (bronze/silver/gold) and Delta Lake / OneLake concepts; prior project experience implementing this pattern is highly desirable.

  • Solid understanding of data lakehouse patterns and Delta Lake / OneLake concepts, with the ability to structure data models that are AI/ML-ready and support semantic modeling.
  • Solid understanding of SQL, relational and dimensional data modeling, query tuning, and basic performance optimization.
  • Familiarity with data quality concepts — null/duplicate/schema-drift checks, basic SCD handling, and validation rules in transformation pipelines.
  • Familiarity with Git, branching strategies, and CI/CD concepts in a data engineering context (Azure DevOps or GitHub Actions).
  • Strong communication and collaboration skills, with the ability to articulate complex data engineering solutions to both technical and non-technical stakeholders, and to lead cross-functional initiatives.

Shift details

Required to work in shift: NO

If Yes Shift Timing: N/A

DISCLAIMER

The preceding job description has been designed to indicate the general nature and level of work performed by employees within this classification. It is not designed to contain or be interpreted as a comprehensive inventory of all duties, responsibilities and qualifications required of employees assigned to this job.

Resume ExampleCover Letter Example

Explore more