Azure Databrick with Anaplan

CapgeminiAtlanta, GA
$80,420 - $106,050Hybrid

About The Position

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be able to reimagine what’s possible. Join us and help the world’s leading organizations unlock the value of technology and build a more sustainable, more inclusive world.LocationThe Job is located in Charlotte, NC - Onsite HybridYour Role We are looking for an experienced Azure Databricks Engineer with strong expertise in cloudbased data engineering ETL development and distributed data processing The ideal candidate should have solid handson experience with PySpark Delta Lake Azure Data Factory and building scalable data pipelines on Azure The engineer will work closely with business Data Architects and crossfunctional teams to design develop and optimize data pipelines for enterprisegrade analytics and reporting.

Requirements

  • Azure Databricks notebooks jobs workflows Delta LakePySpark dataframes Spark SQL optimization debugging
  • Azure Data Factory ADF triggers pipelines integration runtime
  • Data Lake Storage ADLS Gen2 folder structures partitioning security
  • CICD Git branching strategies Azure DevOps pipelines
  • SQL strong proficiency in writing optimized queries

Responsibilities

  • Data Engineering Pipeline DevelopmentDesign develop and optimize ETLELT pipelines using Azure Databricks PySpark
  • Build scalable data ingestion workflows from various structured and unstructured sources
  • Implement transformation logic data cleansing enrichment and validation frameworks
  • Work with Delta Lake to build medallion architecture BronzeSilverGold layers
  • Develop reusable Databricks notebooks and jobs for production data workflows
  • Build and orchestrate pipelines using Azure Data Factory ADF
  • Integrate Databricks with other Azure servicesADLS Azure SQL Event Hub Key Vault Synapse
  • Optimize compute environments clusters pools autoscaling
  • Implement DevOps processes using Git CICD Azure DevOps
  • Optimize PySpark jobs for performance and cost efficiency
  • Implement best practices for data governance security and access control
  • Troubleshoot production issues and perform rootcause analysis
  • Conduct code reviews ensuring coding standards and data qualityCollaboration Documentation
  • Work with Data Architects to define architecture and design patterns
  • Prepare technical documents solution diagrams and runbooks
  • Collaborate with business stakeholders to understand requirements and translate them into technical solutions.

Benefits

  • Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade, Company paid holidays, Personal Days, Sick Leave
  • Medical, dental, and vision coverage (or provincial healthcare coordination in Canada)
  • Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)
  • Life and disability insurance
  • Employee assistance programs
  • Other benefits as provided by local policy and eligibility
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service