Consumer Reports-posted 15 days ago
$100,000 - $120,000/Yr
Full-time • Mid Level
Hybrid • Yonkers, NY
501-1,000 employees

Data powers everything we do at CR—and it’s the foundation for our AI and machine learning efforts that are transforming how we serve consumers. The Data Engineer ( AI/ML & Data Science) will play a critical role in building the data infrastructure that powers advanced AI applications, machine learning models, and analytics systems across CR. Reporting to the Associate Director, AI/M & Data Science, in this role, you will design and maintain robust data pipelines and services that support experimentation, model training, and AI application deployment. If you’re passionate about solving complex data challenges, working with cutting-edge AI technologies, and enabling impactful, data-driven products that support CR’s mission, this is the role for you. This is a hybrid position. This position is not eligible for sponsorship or relocation assistance. As a mission based organization, CR and our Software team are pursuing an AI strategy that will drive value for our customers, give our employees superpowers, and address AI harms in the digital marketplace. We’re looking for an AI/ML engineer to help us execute on our multi-year roadmap around generative AI.

  • Design, develop, and maintain ETL/ELT pipelines for structured and unstructured data to support AI/ML model and application development, evaluation, and monitoring.
  • Build and optimize data processing workflows in Databricks, AWS SageMaker, or similar cloud platforms.
  • Collaborate with AI/ML engineers to deliver clean, reliable datasets for model training and inference.
  • Implement data quality, observability, and lineage tracking within the ML lifecycle.
  • Develop Data APIs/microservices to power AI applications and reporting/analytics dashboards.
  • Support the deployment of AI/ML applications by building and maintaining feature stores and data pipelines optimized for production workloads.
  • Ensure adherence to CR’s data governance, security, and compliance standards across all AI and data workflows.
  • Work with Product, Engineering and other stakeholders to define project requirements and deliverables.
  • Integrate data from multiple internal and external systems, including APIs, third-party datasets, and cloud storage.
  • You have 3+ years of experience designing and developing data pipelines, data models/schemas, APIs, or services for analytics or ML workloads.
  • You’ve earned a Bachelor’s degree in Computer Science, Engineering, or a related field.
  • You are skilled in Python, SQL, and have experience with PySpark on large-scale datasets.
  • You have experience with data orchestration tools such as Airflow, dbt and Prefect, plus CI/CD pipelines for data delivery.
  • You have experience with Data and AI/ML platforms such as Databricks, AWS SageMaker or similar.
  • You have experience working with Kubernetes on cloud platforms like - AWS, GCP, or Azure.
  • You are passionate about automation and continuous improvement.
  • You have excellent documentation and technical communication skills.
  • You are an analytical thinker with troubleshooting abilities.
  • You are self-driven and proactive in solving infrastructure bottlenecks.
  • We offer medical benefits that start on your first day as a CR employee that include behavioral health coverage, family planning and a generous 401K match.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service