SRE Engineer

Stellantis•Auburn Hills, MI

55d•Remote

About The Position

Mobilisights is a Data-as-a-Service (DaaS) business unit of Stellantis, unlocking real-time insights from millions of connected vehicles worldwide. We combine the scale of a global automotive leader with the agility of a startup—fully remote, fast-moving, and impact-driven. We are looking for a self-driven Site Reliability Engineer (SRE) based in North America to help monitor, operate, and improve our cloud and data platforms. This role requires a high degree of independence and ownership, as you will often be the primary SRE during NA coverage hours, collaborating asynchronously with teams in Europe and India.

Requirements

Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field
A minimum of 3 years of experience in SRE, DevOps, or Cloud Operations
Hands-on experience managing AWS-based infrastructure
Strong experience with Terraform and infrastructure-as-code
Practical experience with Grafana and Prometheus
Automation mindset using Python, Bash, PowerShell, or AWS CLI
Experience participating in on-call rotations and production support
Working knowledge of CI/CD pipelines (GitHub Actions, GitLab, Jenkins, etc.)
Understanding of cloud security best practices and incident response
Highly self-motivated and proactive; comfortable working independently
Strong sense of ownership and accountability
Clear and effective remote communication skills
Calm, structured approach to incident handling
Able to collaborate effectively across global teams and time zones

Nice To Haves

Experience supporting data platforms or Data-as-a-Service (DaaS) products
Exposure to streaming and event-driven systems (Kafka, Kinesis, SQS, etc.)
Experience working with high-volume, real-time telemetry or IoT data
Familiarity with data pipelines (batch and streaming) and data reliability concepts
Experience with big data technologies (Spark, Flink, Hadoop, Iceberg, Delta Lake, Databricks, etc.)
Basic understanding of data quality, data latency, and data availability SLIs/SLOs
Experience operating Kubernetes or containerized workloads in cloud environments
Understanding of cost optimization for large-scale data ingestion and storage in AWS
Prior experience in automotive, mobility, IoT, or connected devices ecosystems

Responsibilities

Monitor, operate, and support cloud and data platforms during NA coverage hours
Participate in a 24×7 on-call SRE rotation using a follow-the-sun model
Troubleshoot and resolve production incidents independently; lead incident response when required
Monitor availability, latency, and system health using Grafana and Prometheus
Define and track SLIs and SLOs to improve service reliability
Drive blameless postmortems and ensure permanent incident remediation
Build and maintain infrastructure using Terraform and automation scripts
Act as a hands-on contributor to AWS infrastructure (VPC, EC2, S3, IAM, RDS, ELB, Route53)
Continuously improve reliability, scalability, security, and cost efficiency

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume