Vantage Data Centers powers, cools, protects and connects the technology of the world’s well-known hyperscalers, cloud providers and large enterprises. Developing and operating across North America, EMEA and Asia Pacific, Vantage has evolved data center design in innovative ways to deliver dramatic gains in reliability, efficiency and sustainability in flexible environments that can scale as quickly as the market demands. IT Standards Team Our team is responsible for helping other technology teams on their automation journey, and for developing IT standards that support the IT organization. We embrace many approaches and technologies to speed up the delivery and operations of our Data Centers. From Zero-Touch provisioning of network equipment to the deployment of applications on containerization platforms, we apply our software and operation industry expertise everywhere we can. We question the status-quo and are not afraid to suggest new ways to do things. Individual contributors are encouraged to speak up, propose new insights and take an active role in the definition of our roadmap. Position Overview This role will be based remotely in the US. Our team builds and operates the observability platform for Vantage Data Centers, enabling engineering and operations teams to understand system health, performance, and availability across data center and hybrid environments. To support our growth, we are looking for an experienced Observability Engineer with deep hands-on expertise in Elastic/Elasticsearch, Logstash, and Kibana, and a strong background creating and operationalizing metrics. In this role, you will design, implement, and maintain end-to-end observability for logs and metrics: building resilient ingestion pipelines, defining schemas and parsing standards, creating Kibana dashboards and alerting, and partnering with platform, network, and application teams to set SLIs/SLOs and improve operational outcomes. You will continuously improve performance, reliability, retention, and cost of our telemetry pipelines while applying automation and infrastructure-as-code practices to keep the platform consistent and auditable.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
501-1,000 employees