Senior Observability Engineer

LPL Financial ServicesSan Diego, CA
51dHybrid

About The Position

LPL is seeking a Senior Observability Engineer to enhance system resilience and visibility across our enterprise platforms. This role will focus on designing and implementing scalable observability solutions that support rapid incident response, performance optimization, and continuous improvement. You will collaborate with engineering teams to standardize monitoring practices and drive innovation in observability tooling and strategy. We want strong collaborators who can deliver a world-class client experience. We are looking for people who thrive in a fast-paced environment, are client-focused, team-oriented, and are able to execute in a way that encourages creativity and continuous improvement. We're looking for strong collaborators who deliver exceptional client experiences and thrive in fast-paced, team-oriented environments. Our ideal candidates pursue greatness, act with integrity, and are driven to help our clients succeed. We value those who embrace creativity, continuous improvement, and contribute to a culture where we win together and create and share joy in our work.

Requirements

  • 7+ years of experience in observability, monitoring, or site reliability engineering.
  • Advanced troubleshooting and monitoring expertise using Dynatrace and related APM tools (Dynatrace certification preferred).
  • Rich experience with metrics and logging tools such as SolarWinds, ELK, Kibana.
  • Proficiency in scripting and automation using Python, Bash, or PowerShell.
  • Experience with Monitoring as Code (MaC) using Terraform, CloudFormation, or Ansible.
  • Strong knowledge of Kubernetes, Docker, and microservices architectures.
  • Familiarity with CI/CD pipelines and DevOps practices.
  • Knowledge of AIOps and predictive monitoring techniques.
  • Cross-platform experience in Windows Server, Linux/AIX, Networking, Virtualization, Database (MSSQL/Oracle), Cloud Computing (AWS/Azure), and storage platforms (IBM/EMC/INFINIDAT).
  • Experience with middleware service layers (F5, Tibco, Datapower, MuleSoft), caching technologies, database technologies (MSSQL, Oracle, MySQL, Aurora RDM, MapR), authentication (PingFederate, Forgerock), and RPA tools (Workfusion).

Responsibilities

  • Design, implement, and maintain observability solutions using AWS CloudWatch, Dynatrace, ELK, SolarWinds, and other monitoring tools.
  • Integrate OpenTelemetry for distributed tracing and improve end-to-end system observability.
  • Implement Monitoring as Code using infrastructure-as-code tools such as Terraform and CloudFormation.
  • Partner with SREs, DevOps, and Software Engineers to define and enforce observability standards.
  • Develop and standardize practices for monitoring, logging, and alerting across platforms.
  • Optimize performance monitoring, anomaly detection, and automated incident response strategies.
  • Drive observability-related incident investigations, root cause analysis, and post-mortem processes.
  • Assist in major incidents and participate in on-call rotation for tool support.
  • Continuously evaluate and introduce new observability tools and methodologies.
  • Create dashboards, alerts, and reports to provide actionable insights into system performance and availability.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Securities, Commodity Contracts, and Other Financial Investments and Related Activities

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service