Observability Architect

GeotabAtlanta, GA
Hybrid

About The Position

Geotab ® is a global leader in IoT and connected transportation and certified “Great Place to Work™.” We are a company of diverse and talented individuals who work together to help businesses grow and succeed, and increase the safety and sustainability of our communities. Geotab is advancing security, connecting commercial vehicles to the internet and providing web-based analytics to help customers better manage their fleets. Geotab’s open platform and Geotab Marketplace ®, offering hundreds of third-party solution options, allows both small and large businesses to automate operations by integrating vehicle data with their other data assets. Processing billions of data points a day, Geotab leverages data analytics and machine learning to improve productivity, optimize fleets through the reduction of fuel consumption, enhance driver safety and achieve strong compliance to regulatory changes. Our team is growing and we’re looking for people who follow their passion, think differently and want to make an impact. Ours is a fast paced, ever changing environment. Geotabbers accept that challenge and are willing to take on new tasks and activities - ones that may not always be described in the initial job description. Join us for a fulfilling career with opportunities to innovate, great benefits, and our fun and inclusive work culture. Reach your full potential with Geotab. To see what it’s like to be a Geotabber, check out our blog and follow us @InsideGeotab on Instagram. Join our talent network to learn more about job opportunities and company news. We are always looking for amazing talent who can contribute to our growth and deliver results! Geotab is seeking an SRE Observability Architect who will define the strategic vision, technical architecture, and engineering standards for observability across the organization's cloud platforms. The projects will vary in scope, complexity, and affected business area. If you love technology, and are keen to join an industry leader — we would love to hear from you! As an SRE Observability Architect, your key area of responsibility will be defining the foundational observability architecture that enables scalable, cost-efficient, and highly reliable insight into distributed systems while leading the design of next-generation observability platforms. You will need to work closely with SRE, platform engineering, and application development teams, as well as align with security and compliance stakeholders. To be successful in this role, you will be a strong analytical and systems thinker with the ability to navigate complex, ambiguous technical challenges and articulate technical architecture to executive audiences. In addition, the successful candidate will have deep expertise in designing enterprise-scale observability platforms and the ability to influence and drive technical direction across multiple teams and organizational boundaries.

Requirements

  • 5-8 years of experience in Observability Architecture, Site Reliability Engineering (SRE), or Platform/Infrastructure Engineering.
  • Post-secondary Diploma/Degree in Engineering, Computer Science, or a related field.
  • Mastery of the OpenTelemetry ecosystem and expert-level knowledge of Prometheus-compatible metrics systems (VictoriaMetrics, Thanos, etc.).
  • Advanced experience with tracing systems (Grafana Tempo, Jaeger) and log aggregation platforms (Loki, Elasticsearch, Google BigQuery).
  • Expert-level proficiency in cloud infrastructure (GCP strongly preferred) and Kubernetes architecture.
  • Strong software engineering skills in Go, Python, or similar languages for building cloud-native tooling.
  • Excellent communication skills with the ability to influence technical direction across organizational boundaries.

Nice To Haves

  • Preferred certifications: Google Cloud Professional Cloud Architect or Certified Kubernetes Administrator (CKA).

Responsibilities

  • Define and own the enterprise-wide observability architecture, establishing technical standards, reference architectures, and multi-year roadmaps.
  • Evaluate, select, and standardize observability tooling (e.g., Grafana, Prometheus, VictoriaMetrics, Tempo, Loki, Elastic Stack, OpenTelemetry) to reduce tool sprawl and optimize total cost of ownership.
  • Design scalable data pipelines and storage strategies capable of ingesting and querying petabyte-scale telemetry data across metrics, traces, logs, and continuous profiling.
  • Design Terraform modules and Helm charts for declarative observability infrastructure provisioning across multi-cloud environments.
  • Establish and enforce instrumentation standards using the OpenTelemetry framework, including SDK guidelines, collector deployment patterns, and semantic conventions.
  • Define and champion SLO/SLI/error-budget frameworks across engineering teams, providing architectural guidance on service-level objective implementation.
  • Serve as a senior escalation point during critical incidents, leveraging deep observability expertise to accelerate diagnosis and resolution.
  • Provide architectural mentorship and technical guidance to Observability Engineers and SRE team members.

Benefits

  • Flex working arrangements
  • Home office reimbursement program
  • Baby bonus & parental leave top up program
  • Online learning and networking opportunities
  • Electric vehicle purchase incentive program
  • Competitive medical and dental benefits
  • Retirement savings program
  • The above are offered to full-time permanent employees only

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

Associate degree

Number of Employees

501-1,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service