Data Engineer

Career Mentors, LLCPlano, TX
Onsite

About The Position

We are hiring a Data Engineer to join the Card Risk Platform (CCB) team. This team supports and maintains machine learning models in production, with a focus on modernizing and enhancing PySpark-based data pipelines in AWS. You will be working on converting existing models (Python/Scala) to PySpark, improving Spark performance, and supporting large-scale data processing in AWS EMR environments.

Requirements

  • Strong Python / PySpark
  • Apache Spark
  • Advanced SQL
  • AWS experience
  • MapReduce (EMR experience preferred)
  • Experience supporting production data/ML pipelines

Nice To Haves

  • Databricks (Lakeview, MLflow)
  • Terraform
  • Scala

Responsibilities

  • Maintain and enhance ML models in production
  • Convert existing Python/Scala models to PySpark in AWS
  • Optimize and enhance PySpark/Spark code
  • Work with large-scale distributed data systems
  • Support MapReduce/EMR-based processing workflows
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service