Principal Systems Reliability Engineer

Alaska AirlinesSeaTac, WA
71d$138,500 - $207,750

About The Position

The Principal Systems Reliability Engineer (SRE) is the sole subject matter expert in software engineering and IT operations. As an individual contributor, this role defines long-term strategy for drive continuous improvement in system resilience, contributing to a robust and efficient operational environment. The Principal Systems Reliability (SRE) role drives an enterprise SRE center of excellence team and is instrumental in helping our airline group run a safe, robust, and reliable operation.

Requirements

  • 7 years of experience in information technology or related area.
  • A Bachelor's degree, preferably with a focus in computer science, engineering, information systems, or an additional two years of training/experience in lieu of this degree.
  • Technical knowledge of application designs and architectures.
  • Minimum age of 18.
  • Must be authorized to work in the U.S.
  • High school diploma or equivalent is required.

Nice To Haves

  • Demonstrate experience in coaching and mentoring system engineers.
  • Experience applying ITIL and IT process best practices.
  • Experience with technical engineering working in IT operations.
  • Experience with SRE practices, agile methodologies, development lifecycles, and DevOps best practices.
  • Ability to work collaboratively with cross functional teams to understand objectives, gather automation requirements, write technical specifications and perform in a lead role.
  • Experience with event correlation.
  • Experience with Windows and Linux based operating environments.
  • Experience with multi-cloud environments.
  • Excellent communication skills and a proven ability to collaborate with a variety of team members.
  • Proven ability to successfully work with multiple vendors.
  • Strong interpersonal, organizational, communication, and customer service skills.

Responsibilities

  • Define long-term strategy and best practices for processes for release management and automated actions to enhance system reliability.
  • Define Application Performance Monitoring (APM) standards and assist teams with application monitoring and observability to proactively identify issues.
  • Collaborate with product teams to meet SLO's by minimizing service disruptions and maintain high availability.
  • Collaborate and network with product teams to ensure seamless code deployment and operational excellence.
  • Provide consulting support for system design, platform management, and capacity planning.
  • Collaborate and influence IT teams to improve system support, monitoring, and administration.
  • Ensure operational compliance with all security, privacy, audit, disaster recovery, and other requirements.

Benefits

  • Free stand-by travel privileges on Alaska Airlines, Hawaiian Airlines & Horizon Air.
  • Comprehensive well-being programs including medical, dental and vision benefits.
  • Generous 401k match program.
  • Quarterly and annual bonus plans.
  • Generous holiday and paid time off.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Air Transportation

Education Level

Bachelor's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service