Data Engineer - Senior

Independence Pet Group
24dHybrid

About The Position

Clarus is a pioneering pet health technology company on a mission to transform the future of pet healthcare. We are dynamic entrepreneurs improving pet health by providing trusted and transparent information across the entire pet health landscape. Join us in shaping a smarter, healthier future for pets everywhere. As we expand, we are looking for a seasoned Senior Data Engineer, in the data team, who can hit the ground running by building data pipelines across the medallion architecture.

Requirements

  • Bachelor’s degree in computer science, Information Technology, or related field.
  • At least 7 years of experience in building data pipelines using ELT/ETL.
  • Experience working with Databricks, Spark, and Azure data services.
  • Strong coding skills with Python, SQL and PySpark.
  • Experience designing and maintaining config-driven pipelines with support for Change Data Capture (CDC) and event/stream-based processing.
  • Proficiency with NoSQL and SQL databases.
  • Strong understanding of data modeling, schema design, and database performance optimization.
  • Practical experience working with various file formats, including JSON, Parquet, and ORC.
  • Hands-on experience building and maintaining CI/CD pipelines (Azure DevOps, GitLab) and automating data workflow deployments.
  • Solid understanding of data governance, lineage, and cloud security (Unity Catalog, encryption, access control).
  • Excellent problem-solving and analytical skills.
  • Experience with data modeling tools and techniques.
  • Knowledge of cloud-based data solutions and platforms.

Responsibilities

  • Design, develop, and deploy robust and scalable data pipelines using Databricks, integrating data from various sources.
  • Optimize pipeline performance for greater efficiency and cost-effectiveness.
  • Contribute to the design and implementation of robust data models in the data lake environment, ensuring data consistency, integrity and accessibility for various use cases.
  • Implement rigorous data quality controls and validation procedures throughout the data pipeline to ensure high accuracy and reliability.
  • Comply with the data governance policies and best practices.
  • Seamlessly integrate Databricks with Azure cloud services for storage, compute and security, leveraging services like Azure Data Lake Storage.

Benefits

  • Comprehensive full medical, dental and vision Insurance
  • Basic Life Insurance at no cost to the employee
  • Company paid short-term and long-term disability
  • 12 weeks of 100% paid Parental Leave
  • Health Savings Account (HSA)
  • Flexible Spending Accounts (FSA)
  • Retirement savings plan
  • Personal Paid Time Off
  • Paid holidays and company-wide Wellness Day off
  • Paid time off to volunteer at nonprofit organizations
  • Pet friendly office environment
  • Commuter Benefits
  • Group Pet Insurance
  • On the job training and skills development
  • Employee Assistance Program (EAP)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service