Engineer II, Site Reliability Operations

PennymacWestlake Village, CA
14d$68,000 - $115,000Onsite

About The Position

Pennymac is (NYSE: PFSI) is a specialty financial services firm with a comprehensive mortgage platform and integrated business focused on the production and servicing of U.S. mortgage loans and the management of investments related to the U.S. mortgage market. At Pennymac, our people are the foundation of our success and at the heart of our dynamic work culture. Together, we work towards a unified goal of helping millions of Americans achieve aspirations of homeownership through the complete mortgage journey. A Typical Day As the Site Reliability Operations, Engineer II (SRO), you will help the team provide 24/7 monitoring and support of the company's IT Infrastructure. Ideal candidates should have experience in Windows and Linux administration, in addition to experience working in AWS, as Pennymac is now almost completely migrated into the AWS cloud. Individuals in this role should be comfortable working in a fast-paced environment. Multitasking, in addition to communicating quickly and accurately, is critical to the success of anyone in this role.

Requirements

  • Bachelor's Degree in Computer Science or comparable experience
  • AWS Solutions Architect and/or AWS SysOps Administrator certification
  • Proficient with Windows and Linux administration
  • Proficient with Monitoring and Alerting tools such as Nagios, New Relic, SumoLogic, and AWS CloudWatch
  • Proficient with programming languages such as Powershell or Python
  • Strong attention to detail
  • Able to prioritize tasks and have a sense of urgency with critical issues or requests
  • Excellent written and verbal communication skills
  • Must be comfortable completing annual role-based training and certification assignments

Responsibilities

  • Monitoring - 24/7 health monitoring of Pennymac's IT Infrastructure using tools such as AWS CloudWatch and New Relic.
  • Alert Management - participate in the active modification and creation of alerts to ensure the SRO team has constant visibility and is able to proactively identify threats to the stability of Pennymac's IT Infrastructure.
  • Incident Management - Engineers will coordinate with Pennymac's Incident Management team, Application Developers, Internal Support Teams, and 3rd Party Vendors, with the goal of resolving any production service outages quickly and accurately.
  • Systems Administration - responsible for various administrative tasks in both a Windows or Linux environment.
  • Virtual Server and Desktop Management - maintenance and troubleshooting of Pennymac's virtual server and desktop environments.
  • Technical Troubleshooting and Investigation - investigate and troubleshoot various technical issues that are submitted by Pennymac's IT and Application Development teams.
  • Internal and External Escalation - act as a point of escalation for any production impacting incidents. Ensure both internal and external support teams are contacted in a timely manner to ensure a quick and accurate resolution.
  • Change Management - follow and enforce Pennymac's established Change Management processes and procedures.
  • Communication - monitor and respond to Call, Chat, and Email inquiries sent to the SRO team.
  • Ticket Queue Management - responsible for managing multiple different Ticket Queues using tools such as ServiceNow and JIRA to ensure deliverables are on time and accurate.
  • Documentation - assist in maintaining the SRO team's knowledge base of support articles and Standard Operating Procedures. Play an active role in the creation of new documentation as needed.
  • Deployments - handle application and website code deployments, making use of tools such as Jenkins and GitLab.
  • Data backup, recovery, retention, and compliance - responsible for various tasks related to backup management using tools like CommVault and AWS Backup.
  • Project Management - organize and prioritize tasks, adhere to deadlines, and achieve all project goals within the given constraints.

Benefits

  • Comprehensive Medical, Dental, and Vision
  • Paid Time Off Programs including vacation, holidays, illness, and parental leave
  • Wellness Programs, Employee Recognition Programs, and onsite gyms and cafe style dining (select locations)
  • Retirement benefits, life insurance, 401k match, and tuition reimbursement
  • Philanthropy Programs including matching gifts, volunteer grants, charitable grants and corporate sponsorships
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service