Senior Data Scientist

Caterpillar Inc.Mossville, IL
1d$112,710 - $183,140

About The Position

Your Work Shapes the World at Caterpillar Inc. When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it. Your Impact Shapes the World at Caterpillar Inc. When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it. CATERPILLAR REMANUFACTURING Do you have a passion for helping customers and supporting a sustainable solution? Since 1973, Cat Reman has helped bring the value at the core of every Cat product back to life. Customers look to us to provide a high-quality solution that is good for their business - and promotes sustainability. It's important to us to deliver. Our team is full of industry leaders. Together, following common values, we share a passion for delivering sustainability benefits that help the company contribute to a circular economy. It’s rewarding to work, with an inspiring team, where every contribution matters. Job Summary Caterpillar’s Remanufacturing Division is looking for a Senior Data Scientist to support the Mossville, IL facility. This position will architect, implement, and maintain robust data platform infrastructure and services to empower data scientists to develop, test, and deploy models with maximum velocity. Focuses on abstracting underlying infrastructure complexities and conducting data science or engineering initiatives to validate platform functionality and support data-driven business decisions. Degree Requirement: Degree or equivalent experience is desired.

Requirements

  • Cloud and Container Orchestration: Knowledge of cloud providers (AWS/GCP/Azure) and container management; ability to design and manage production-grade Kubernetes clusters.
  • Level Extensive Experience: Designs and implements auto-scaling, self-healing infrastructure. Manages complex networking, storage, and security policies within Kubernetes. Optimizes cloud spend and resource allocation for high-compute data workloads. Troubleshoots complex distributed system failures across Linux environments.
  • System Programming and API Development: Proficiency in Python and modern frameworks like FastAPI; ability to build performant, well-documented services.
  • Level Extensive Experience: Architects asynchronous, scalable APIs to support data ingestion and model inference. Implements robust error handling, logging, and telemetry across platform services. Optimizes Python code for performance-critical components of the data pipeline. Enforces strict typing and coding standards to ensure long-term maintainability.
  • DevOps and Automation: Knowledge of CI/CD tools and automation strategies; ability to treat infrastructure as software.
  • Level Extensive Experience: Builds and maintains automated pipelines for testing, building, and deploying ML models. Utilizes Terraform or similar tools to manage infrastructure state. Implements comprehensive monitoring and alerting for platform health. Drives the "Platform-as-a-Service" internal culture to reduce developer friction.
  • Accuracy and Attention to Detail: Understanding the necessity and value of accuracy; ability to complete tasks with high levels of precision.
  • Level Extensive Experience: Evaluates and makes contributions to best practices for system reliability. Processes large quantities of configuration and infrastructure data with high levels of accuracy. Productively balances system performance with security and stability.

Responsibilities

  • Directing the design, implementation, and maintenance of the rada platform infrastructure; creating scalable and resilient environments that allow data scientists to focus on data rather than underlying resources.
  • Developing high-performance features and APIs using Python and FastAPI; providing the tooling necessary for seamless data science workflows and model serving.
  • Executing end-to-end data science and engineering projects to validate platform capabilities; ensuring technical features align with practical research and production requirements through direct implementation.
  • Leading the orchestration of workloads via Kubernetes and Linux-based systems; promoting efficiency through containerization and advanced resource management.
  • Automating CI/CD pipelines and deployment processes for data projects; implementing "infrastructure as code" to ensure consistency across cloud environments.
  • Ensuring the security, compliance, and monitoring of the platform; conducting regular system audits to maintain high availability and operational integrity.
  • Partnering with data scientists to identify platform bottlenecks; identifying and implementing features that abstract infrastructure complexity to drive research impact.

Benefits

  • Medical, dental, and vision coverage
  • Paid time off plan (Vacation, Holiday, Volunteer, Etc.)
  • 401k savings plan
  • Health savings account (HSA)
  • Flexible spending accounts (FSAs)
  • Short and long-term disability coverage
  • Life Insurance
  • Paid parental leave
  • Healthy Lifestyle Programs
  • Employee Assistance Programs
  • Voluntary Benefits (Ex. Accident, Identity Theft Protection)
  • Incentive bonus
  • Adoption benefits
  • Tuition Reimbursement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service