Skyworks-posted 4 months ago
$126,500 - $241,700/Yr
Full-time • Senior
Irvine, CA
Computer and Electronic Product Manufacturing

We are seeking a highly experienced Principal Data Engineer to lead the design and implementation of scalable, high-performance data solutions. This role will focus on cloud-native architectures using Azure, Databricks, Kafka, Spark, and other modern data technologies. You will play a key role in shaping our data strategy, mentoring engineering teams, and driving innovation across our data platforms.

  • Architect and implement scalable data pipelines using Apache Spark, Databricks, and Azure Data Factory.
  • Lead the development of real-time streaming solutions using Apache Kafka.
  • Design and optimize ETL/ELT workflows for structured and unstructured data.
  • Build and maintain distributed data systems using Cassandra, Delta Lake, and other modern data stores.
  • Utilize Delta Live Tables (DLT) to create reliable, maintainable, and testable batch and streaming pipelines.
  • Integrate Databricks with Azure Machine Learning, Azure Synapse, and other cloud services.
  • Implement CI/CD pipelines using Azure DevOps, Terraform.
  • Collaborate with data scientists to deploy and manage ML models using MLflow.
  • Ensure data quality, governance, and security across all engineering efforts.
  • Troubleshoot and resolve issues in data models, workflows, and infrastructure.
  • Design and maintain data models for cloud data warehouses such as Snowflake or Databricks.
  • Apply advanced techniques like data partitioning, indexing, and compression to optimize performance and storage.
  • Develop disaster recovery plans and backup strategies to ensure business continuity.
  • Mentor junior engineers and foster a culture of technical excellence and innovation.
  • Stay current with emerging technologies and recommend strategic adoption where appropriate.
  • Collaborate with global Agile teams to deliver high-quality solutions.
  • Bachelor's degree and 12+ years of experience in data engineering, with at least 3 years in a principal or lead role.
  • Expertise in Azure, Databricks, Apache Spark, Kafka, and Cassandra.
  • Strong programming skills in Python, SQL, and Scala.
  • Experience with distributed systems, data modeling, and data warehousing.
  • Familiarity with machine learning pipelines, MLOps, and cloud-native architectures.
  • Proven ability to lead cross-functional teams and deliver complex data solutions.
  • Excellent communication, problem-solving, and leadership skills.
  • Exposure to big data tools and distributed computing.
  • Certifications in Azure or Databricks are a plus.
  • Access to healthcare benefits (including a premium-free medical plan option)
  • 401(k) plan and company match
  • Employee stock purchase plan
  • Paid time off (including vacation, sick/wellness, parental leave)
  • Eligibility to participate in an incentive plan
  • Potential for additional awards, including recognition and stock based on performance
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service