Databricks Platform Architect

CapgeminiAtlanta, GA
Onsite

About The Position

We are seeking an experienced Databricks Platform Architect who can work with our client(s) in defining and delivering end to end analytics and data science solutions on the Databricks Lakehouse Platform. Has experience and extensively worked with Iceberg for open table format workloads and cross-platform interoperability between Databricks, Snowflake and Microsoft Fabric.Has good experience in designing SQL and Spark based Transformations on Databricks, the person needs to be experienced enough to showcase the technical maturity on various aspects of data architecture and advise the customer on a architecture that complements Databricks Iceberg along with interoperability.

Requirements

  • Proven track record of designing and deploying large scale data platforms on Azure and Databricks
  • Strong understanding of data modeling (star/snowflake schemas), ELT/ETL patterns, and data governance.
  • Excellent communication skills: ability to lead technical discussions and present to executive stakeholders.
  • Good Understanding of the DBT based ETL processing
  • Databricks Certified Data Engineer Associate (or higher).
  • Certification in at least one major cloud provider (AWS/Azure/GCP).

Nice To Haves

  • Databricks Certified Data Architect Professional (preferred).

Responsibilities

  • Lead workshops with technical stakeholders to gather requirements and translate them into a comprehensive Databricks Lakehouse architecture with interoperability with snowflake, Databricks on another tenant and Microsoft Fabric.
  • Leverage Photon execution engine to accelerate SQL-based transformations, queries, and high-performance ELT pipelines.
  • Tune SQL and Spark jobs for optimal performance, including partitioning, , Adaptive Query Execution (AQE), broadcast joins, and cluster-level optimization.
  • Improve pipeline performance and cost efficiency through Photon-optimized SQL workloads, cluster sizing, autoscaling, and effective use of SQL Warehouses and Job Clusters.
  • Align workloads to the appropriate Databricks compute types (SQL Warehouses, Photon, All-Purpose Clusters) to balance performance, concurrency, and cost

Benefits

  • Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade, Company paid holidays, Personal Days, Sick Leave
  • Medical, dental, and vision coverage (or provincial healthcare coordination in Canada)
  • Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)
  • Life and disability insurance
  • Employee assistance programs
  • Other benefits as provided by local policy and eligibility
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service