SRE Engineer

StellantisAuburn Hills, MI
1dRemote

About The Position

Mobilisights is a Data-as-a-Service (DaaS) business unit of Stellantis, unlocking real-time insights from millions of connected vehicles worldwide. We combine the scale of a global automotive leader with the agility of a startup—fully remote, fast-moving, and impact-driven. We are looking for a self-driven Site Reliability Engineer (SRE) based in North America to help monitor, operate, and improve our cloud and data platforms. This role requires a high degree of independence and ownership, as you will often be the primary SRE during NA coverage hours, collaborating asynchronously with teams in Europe and India.

Requirements

  • Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field
  • A minimum of 3 years of experience in SRE, DevOps, or Cloud Operations
  • Hands-on experience managing AWS-based infrastructure
  • Strong experience with Terraform and infrastructure-as-code
  • Practical experience with Grafana and Prometheus
  • Automation mindset using Python, Bash, PowerShell, or AWS CLI
  • Experience participating in on-call rotations and production support
  • Working knowledge of CI/CD pipelines (GitHub Actions, GitLab, Jenkins, etc.)
  • Understanding of cloud security best practices and incident response
  • Highly self-motivated and proactive; comfortable working independently
  • Strong sense of ownership and accountability
  • Clear and effective remote communication skills
  • Calm, structured approach to incident handling
  • Able to collaborate effectively across global teams and time zones

Nice To Haves

  • Experience supporting data platforms or Data-as-a-Service (DaaS) products
  • Exposure to streaming and event-driven systems (Kafka, Kinesis, SQS, etc.)
  • Experience working with high-volume, real-time telemetry or IoT data
  • Familiarity with data pipelines (batch and streaming) and data reliability concepts
  • Experience with big data technologies (Spark, Flink, Hadoop, Iceberg, Delta Lake, Databricks, etc.)
  • Basic understanding of data quality, data latency, and data availability SLIs/SLOs
  • Experience operating Kubernetes or containerized workloads in cloud environments
  • Understanding of cost optimization for large-scale data ingestion and storage in AWS
  • Prior experience in automotive, mobility, IoT, or connected devices ecosystems

Responsibilities

  • Monitor, operate, and support cloud and data platforms during NA coverage hours
  • Participate in a 24×7 on-call SRE rotation using a follow-the-sun model
  • Troubleshoot and resolve production incidents independently; lead incident response when required
  • Monitor availability, latency, and system health using Grafana and Prometheus
  • Define and track SLIs and SLOs to improve service reliability
  • Drive blameless postmortems and ensure permanent incident remediation
  • Build and maintain infrastructure using Terraform and automation scripts
  • Act as a hands-on contributor to AWS infrastructure (VPC, EC2, S3, IAM, RDS, ELB, Route53)
  • Continuously improve reliability, scalability, security, and cost efficiency
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service