What's the role? The primary responsibility of this role is to lead and scale the Business Intelligence engineering team responsible for delivering reliable, accurate, and actionable business dashboards and data insights across the Foundation Engineering. This role will ensure end-to-end ownership of BI platforms, including data pipelines, semantic layers, reporting tools, and analytics workflows. The Engineering Manager II will manage technical execution, prioritize BI initiatives aligned with business goals, and ensure high data quality, performance, and governance standards. They will partner closely with product, operations, finance, and leadership teams to translate business requirements into scalable analytics solutions and drive data-driven decision-making. Additionally, this role will mentor and develop BI engineers and analysts, improve development best practices, and continuously enhance dashboard usability, self-service analytics, and data accessibility while ensuring system reliability and on-time delivery. Who are you? Proven experience managing high-performing platform or observability engineering teams, with a track record of delivering scalable monitoring, logging, tracing, and alerting solutions under production-critical environments. A strong coach and mentor who builds SRE/Observability maturity across engineering teams and promotes a culture of reliability, ownership, and operational excellence. Hands-on engineering background with expertise in distributed systems, Java or similar backend technologies, and strong knowledge of data stores such as PostgreSQL or time-series databases. Deep experience designing and operating cloud-native observability platforms on AWS (EKS/ECS, Lambda, CloudWatch, S3, OpenSearch, RDS, Aurora). Strong expertise in observability tooling such as Prometheus, Grafana, ELK/OpenSearch stack, Jaeger, OpenTelemetry, and event-streaming platforms like Kafka. Experience designing monitoring strategies for microservices architectures, event-driven systems, REST APIs, and real-time data pipelines. Strong understanding of SRE principles including SLIs, SLOs, error budgets, incident management, root cause analysis, and reliability engineering. Experience leveraging DevOps practices to improve deployment observability, production stability, CI/CD monitoring, and proactive issue detection. Strong Agile practitioner (Scrum/Lean) with a data-driven approach to reliability metrics and operational KPIs. Excellent communication skills with the ability to translate technical telemetry insights into business impact. B.Sc. or higher in Computer Science, Engineering, or related field.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager