Elastic Observability Engineer - PLEX

Rockwell AutomationMayfield Heights, OH
19hHybrid

About The Position

The Elastic Observability Engineer is an important member of our Cloud Operations team, building a world-class Application Performance Monitoring solution to support observability-driven development. You will handle alert response, triage, maintenance, expansion and development to ensure monitoring systems remain healthy and reliable.

Requirements

  • Bachelor's Degree or equivalent years of relevant work experience
  • Legal authorization to work in the US is required- we will not sponsor individuals for employment visas, not now or in the future, for this job opening

Nice To Haves

  • Typically requires 2+ years of relevant experience in observability, monitoring, logging, or IT operations.
  • Linux/Unix: Hands‑on administration, patching, networking basics, LVM, and troubleshooting.
  • Windows: Basic OS administration experience.
  • Elastic Stack: Working knowledge of Elasticsearch, Kibana, Fleet, Elastic Agent, APM, Logstash, architecture basics, and core troubleshooting.
  • Observability: Understanding of monitoring practices, observability concepts, dashboards, and visualization creation.
  • Scripting & Automation: Bash scripting, familiarity with Ansible, and experience with Python or PowerShell.
  • Soft Skills: problem‑solving, ability to work in a fast‑paced environment, and self‑starter mindset.
  • Linux/Unix engineering experience; Windows engineering experience.
  • OpenTelemetry (OTEL) or APM experience.
  • Knowledge of Ansible, Terraform, Rabbit MQ, Docker, Kubernetes, AZDO and CI/CD.
  • Guest‑level experience with VMware or Azure.
  • Project management exposure.

Responsibilities

  • Operational Support & Triage Respond to alerts, ingestion issues, and user‑reported incidents.
  • Perform initial diagnosis, document findings, and resolve or escalate as needed.
  • Monitor and troubleshoot Elasticsearch clusters, pipelines, Elastic Agent, Logstash, and Fleet.
  • Participate in the on‑call rotation.
  • Maintenance & Routine Operations Perform service restarts, ingestion validation, and configuration updates.
  • Apply Linux/Unix patches and perform basic OS maintenance.
  • Conduct regular cluster health, capacity, and LVM/storage checks.
  • Support dashboard upkeep and proactive monitoring.
  • Data Quality & Pipeline Support Verify log, metric, and trace data completeness and ECS alignment.
  • Perform periodic data validation.
  • Document known issues, parsing gaps, and operational patterns.
  • Dashboards, Visualization & Search Use and maintain Kibana dashboards for operational visibility.
  • Create dashboards and visualizations to support platform monitoring.
  • Use KQL or Lucene queries to validate data and investigate issues.
  • Collaboration Provide actionable escalation details to SRE, DevOps, and Security teams.
  • Maintain runbooks, SOPs, and troubleshooting guides.
  • Communicate effectively during incidents and follow‑ups.

Benefits

  • Health Insurance including Medical, Dental and Vision
  • 401k
  • Paid Time off
  • Parental and Caregiver Leave
  • Flexible Work Schedule where you will work with your manager to enjoy a work schedule that can be flexible with your personal life.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service