Jobs via Dice
Data Engineer with Pyspark
Salary
-
Job type
Full-time
Location
Rocky Hill, Connecticut, US
Remote
No
Posted
Yesterday
Resume Examples
Browse professional resume examples with key skills, action verbs, and ATS-friendly formatting.
Browse resume examplesJob description
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Hexplora, is seeking the following. Apply via Dice today!
Title: Data Engineer (PySpark Required)
Location: Rocky Hill, CT (Onsite)
Job Summary
We are seeking a skilled Data Engineer with strong experience in PySpark to join our team in Rocky Hill, CT. This is a fully onsite role where you will be responsible for designing, building, and optimizing scalable data pipelines and data processing systems. The ideal candidate is passionate about big data technologies, data architecture, and delivering high-quality data solutions that drive business insights.
Key Responsibilities
- Design, develop, and maintain scalable data pipelines using PySpark
- Build and optimize ETL/ELT workflows for large-scale data processing
- Work with structured and unstructured datasets from multiple sources
- Collaborate with data analysts, data scientists, and business stakeholders to understand data requirements
- Ensure data quality, integrity, and security across all data platforms
- Optimize data processing performance and troubleshoot data-related issues
- Implement best practices for data governance and data lifecycle management
- Maintain documentation for data processes, workflows, and systems
Required Qualifications
- Bachelor''s degree in Computer Science, Information Technology, or related field
- Hands on experience as a Data Engineer .
- Strong hands-on experience with PySpark (required)
- Proficiency in Python and SQL
- Experience working with big data technologies (e.g., Apache Spark, Hadoop ecosystem)
- Experience with cloud platforms such as AWS, Azure, or Google Cloud
- Familiarity with data warehousing concepts and tools
- Strong problem-solving and analytical skills
Infowave Systems is an equal opportunity employer that is committed to diversity and inclusion in the workplace.
Responsibilities
- This is a fully onsite role where you will be responsible for designing, building, and optimizing scalable data pipelines and data processing systems
- The ideal candidate is passionate about big data technologies, data architecture, and delivering high-quality data solutions that drive business insights
- Design, develop, and maintain scalable data pipelines using PySpark
- Build and optimize ETL/ELT workflows for large-scale data processing
- Work with structured and unstructured datasets from multiple sources
- Collaborate with data analysts, data scientists, and business stakeholders to understand data requirements
- Ensure data quality, integrity, and security across all data platforms
- Optimize data processing performance and troubleshoot data-related issues
- Implement best practices for data governance and data lifecycle management
- Maintain documentation for data processes, workflows, and systems
Qualifications
- Bachelor''s degree in Computer Science, Information Technology, or related field
- Hands on experience as a Data Engineer
- Strong hands-on experience with PySpark (required)
- Proficiency in Python and SQL
- Experience working with big data technologies (e.g., Apache Spark, Hadoop ecosystem)
- Experience with cloud platforms such as AWS, Azure, or Google Cloud
- Familiarity with data warehousing concepts and tools
- Strong problem-solving and analytical skills
Stand out from other applicants
AI reads this job description and tailors your resume to match, optimized for ATS filters.
Similar jobs
Ready to land your next role?
Join thousands of professionals who use Mokaru to manage their job search. AI-powered resume tailoring, application tracking, and more.
Create Free Resume