GCP /Python Data Engineer

CapgeminiAtlanta, GA
$80,420 - $106,050Hybrid

About The Position

GCP Python Lead Data EngineerWe are seeking a highly skilled GCP Data Engineer with strong Python expertise to design, build, and optimize scalable data solutions on Google Cloud Platform (GCP). The ideal candidate will have hands-on experience developing batch and real-time data pipelines, working with large-scale datasets, and enabling analytics and AI/ML use cases.

Requirements

  • GCP Services: BigQuery, Dataflow, Dataproc, Pub/Sub, Cloud Functions (Gen2), Cloud Composer (Airflow), Cloud Storage, Cloud SQL
  • Programming: Python (advanced)
  • Query Language: SQL (advanced)
  • Data Processing: Batch & Streaming architectures
  • Scripting: Bash/Shell scripting
  • Concepts: Data Warehousing, ETL/ELT, Data Lake / Lakehouse architectures
  • 8+ years of overall data engineering or software engineering experience
  • 3+ years of hands-on Google Cloud Platform experience
  • 3+ years of Python development
  • 3+ years of experience building data pipelines (batch and streaming)

Nice To Haves

  • Experience with Dataproc (Spark/PySpark) for large-scale processing
  • Familiarity with event-driven architectures
  • Knowledge of Terraform or Infrastructure as Code
  • Understanding of cost optimization (FinOps)
  • Google Cloud Professional Data Engineer Certification
  • Experience supporting AI/ML data pipelines

Responsibilities

  • Design, build, and maintain scalable batch and real-time data pipelines using GCP services such as Dataflow, Dataproc, and Pub/Sub
  • Develop and optimize ETL/ELT workflows for structured and unstructured data processing
  • Implement event-driven data processing using Cloud Functions and Pub/Sub
  • Build and manage data ingestion frameworks for streaming and batch data sources
  • Design and optimize data lakes and data warehouses using BigQuery and Cloud Storage
  • Develop efficient data models to support analytics, reporting, and machine learning workloads
  • Optimize performance and cost of data pipelines and queries
  • Develop reusable and scalable solutions using Python
  • Automate workflows and orchestration using Cloud Composer (Airflow)
  • Implement CI/CD pipelines and deployment automation
  • Collaborate with analytics, AI/ML, and business teams for data consumption needs
  • Troubleshoot data issues and perform root cause analysis
  • Continuously improve pipeline reliability, scalability, and performance

Benefits

  • Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade, Company paid holidays, Personal Days, Sick Leave
  • Medical, dental, and vision coverage (or provincial healthcare coordination in Canada)
  • Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)
  • Life and disability insurance
  • Employee assistance programs
  • Other benefits as provided by local policy and eligibility
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service