Senior Lead Reliability Engineer

LSEGCreve Coeur, MO
1d

About The Position

ABOUT US: LSEG (London Stock Exchange Group) is more than a diversified global financial markets infrastructure and data business. We are dedicated, open-access partners with a dedication to excellence in delivering the services our customers expect from us. With extensive experience, deep knowledge and worldwide presence across financial markets, we enable businesses and economies around the world to fund innovation, manage risk and create jobs. It’s how we’ve contributed to supporting the financial stability and growth of communities and economies globally for more than 300 years. Through a comprehensive suite of trusted financial market infrastructure services – and our open-access model – we provide the flexibility, stability and trust that enable our customers to pursue their ambitions with confidence and clarity. LSEG is headquartered in the United Kingdom, with significant operations in 70 countries across EMEA, North America, Latin America and Asia Pacific. We employ 25,000 people globally, more than half located in Asia Pacific. LSEG’s ticker symbol is LSEG. OUR PEOPLE: People are at the heart of what we do and drive the success of our business. Our culture of connecting, creating opportunity and delivering excellence shape how we think, how we do things and how we help our people fulfil their potential. We embrace diversity and actively seek to attract individuals with unique backgrounds and perspectives. We break down barriers and encourage teamwork, enabling innovation and rapid development of solutions that make a difference. Our workplace generates an enriching and rewarding experience for our people and customers alike. Our vision is to build an inclusive culture in which everyone feels encouraged to fulfil their potential. We know that real personal growth cannot be achieved by simply climbing a career ladder – which is why we encourage and enable a wealth of avenues and interesting opportunities for everyone to broaden and deepen their skills and expertise. As a global organisation spanning 70 countries and one rooted in a culture of growth, opportunity, diversity and innovation, LSEG is a place where everyone can grow, develop and fulfil your potential with meaningful careers!! ROLE SUMMARY: We are evolving our Reliability Engineering team to move beyond support and operations. As a Senior Engineer in Site Reliability, you will be part of a diverse and inclusive organization that has full ownership of the availability, performance, and scalability of one of the most critical shared services at LSEG. We are looking for people with a passion to learn, and who bring a continuous improvement mentality to our team! SREs maintain Service Level Objectives for the systems they own. Constantly measuring and improving availability, latency, and overall system health is at the core of our team's purpose. You will be writing automation to scale systems sustainably, prevent service issues, or when they occur, quickly recover service alongside partner with development teams to improve system reliability, observability, and release velocity. You will participate in on-call rotations, incident response, post-mortems, and root cause analysis and resolution and be a vocal advocate of strong/sound engineering practices that allow us to build, deploy, and run scalable, reliable, and performant services. Be part of continuous learning and development culture.

Requirements

  • A Bachelor's degree in computer science, a related technical field involving software/systems engineering, or equivalent practical experience.
  • Object Oriented programming languages such as: Java, C#, Python, or Go.
  • Unix/Linux and Windows operating systems.
  • Hands-on experience with one of the following cloud platforms: Azure, AWS, or GCP.
  • DevOps concepts and way of working
  • Experience with algorithms and data structures.
  • Observability practices with logging, metrics, tracing, and alerting.
  • Infrastructure as Code.
  • Understanding of identity and access management, and application security.
  • We use Datadog and BigPanda for our observability stack, Terraform for our cloud infrastructure, and EntraID as our IAM solutions but we're very open to incorporating your experience with any other tools.

Responsibilities

  • Leading projects priorities, deadlines, and outcomes.
  • Utilising deep knowledge of site reliability, software engineering, programming languages, tooling, frameworks, infrastructure and systems for each task.
  • Leading designs of software components, systems, and features to improve the availability, scalability, latency, and efficiency of LSEG’s services.
  • Leading sustainable incident response and production improvements for LSEG.
  • Providing mentorship and advice to other team members on leading availability and performance of critical services, building automation to prevent problem recurrence, and building automated responses for non-exceptional service conditions.
  • Mentoring and training other team members on design techniques and coding standards, and cultivate innovation and collaboration.
  • Writing and reviewing highly optimised and accurate code for LSEG products and solutions and provides feedback and suggested improvements to team members.
  • Partnering with architects to decompose a solution for technology systems and products
  • Proactively building and applying relevant domain knowledge that may relate to workflows, data pipelines, business policies, configurations and constraints
  • Supporting essential processes while ensuring high quality standards are met.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service