Infrastructure Engineer

UBSRaleigh, NC
74d

About The Position

Do you have a solid tech foundation and a passion for reliability engineering to grow within a global, mission-critical environment? Are you eager to learn, thrive in a collaborative Agile environment, and are excited to contribute to the reliability of financial platforms? We are looking for a motivated and curious Site Reliability Engineer (SRE) to assist in maintaining and improving the availability and reliability of systems and services, support the definition and tracking of SLIs, SLOs, and SLAs under guidance from other engineers, help identify and reduce TOIL through automation and process improvement, participate in incident response and learn from post-incident reviews, collaborate with development and infrastructure teams to integrate reliability best practices, and contribute to the integration of systems with observability platforms (e.g., Prometheus, Grafana, ELK, Datadog).

Requirements

  • Ideally, 5+ years of experience in systems engineering, DevOps, or software development roles.
  • Familiarity with operating systems (e.g. Unix, Windows), cloud platforms (e.g., Azure), and basic networking concepts.
  • Exposure to scripting or programming (e.g., Python, Bash, Go).
  • Understanding of CI/CD pipelines and containerization (e.g., Docker).
  • Awareness of infrastructure as code tools (e.g., Terraform, Ansible).
  • Interest in observability, monitoring, and incident response.
  • Willingness to learn about SLIs, SLOs, SLAs, and TOIL reduction.
  • A collaborative, proactive attitude and a desire to work in a global Agile team.

Responsibilities

  • Assist in maintaining and improving the availability and reliability of systems and services.
  • Support the definition and tracking of SLIs, SLOs, and SLAs under guidance from other engineers.
  • Help identify and reduce TOIL through automation and process improvement.
  • Participate in incident response and learn from post-incident reviews.
  • Collaborate with development and infrastructure teams to integrate reliability best practices.
  • Contribute to the integration of systems with observability platforms (e.g., Prometheus, Grafana, ELK, Datadog).

Benefits

  • Flexible working options when possible.
  • Opportunities to grow and develop skills.
  • Supportive team environment.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Securities, Commodity Contracts, and Other Financial Investments and Related Activities

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service