JobMesh

Pyspark Data Engineer with Databricks

Capgemini · New York City, New York, US

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by...

Job description

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be able to reimagine what’s possible. Join us and help the world’s leading organizations unlock the value of technology and build a more sustainable, more inclusive world. Job Location : New York, NY Job Description: We are looking for a hands-on mid–senior level PySpark Data Engineer with Databricks who can design, build, and own production-grade data pipelines and platform components. This role requires strong expertise in Python/PySpark, Databricks, and Snowflake, with a focus on building scalable, cost‑efficient, and reliable data systems that support both analytics and machine learning use cases. Key Responsibilities: - Design, develop, and maintain end‑to‑end ETL/ELT pipelines using Python and PySpark on Databricks . - Optimize Spark jobs for performance, scalability, and cost-efficiency in production environments. - Implement data quality frameworks including validation, reconciliation, and anomaly detection. - Build and manage orchestration work...