Google Cloud Lead Data Engineer (GCP)
Capgemini · Nashville, Tennessee, US
Job Description . We are seeking a skilled Data Engineer with hands-on experience in Databricks and Google Cloud Platform (GCP) to design, build, and optimiz...
Job description
Job Description . We are seeking a skilled Data Engineer with hands-on experience in Databricks and Google Cloud Platform (GCP) to design, build, and optimize data pipelines and analytics solutions. The ideal candidate will have a strong background in distributed data processing, cloud architecture, and data modeling. This role partners closely with data analysts, data scientists, and business stakeholders to deliver scalable, reliable, and high‑quality data products. Your role: Design, build, and maintain ETL/ELT pipelines using Databricks (PySpark, Delta Lake). Optimize pipelines for performance, cost efficiency, and scalability within GCP. Develop batch and streaming data processes using Spark Streaming, and related technologies. Implement data solutions leveraging GCP services such as BigQuery, Cloud Storage, Dataflow, Cloud Composer, and Vertex AI integrations. Apply best practices for cloud security, IAM configuration, monitoring, and cost management. Build and maintain data models, including dimensional modeling and data vault structures. Implement data quality frameworks, validation rules, and automated testing. Manage data versioning, governance, and lineage using tools su...