Pennymac-posted 3 months ago
$75,000 - $130,000/Yr
Full-time • Mid Level
Westlake Village, CA

As a member of the Site Reliability Operations (SRO) team, you will help provide 24/7 monitoring and support of Pennymac's database infrastructure and related systems. This role focuses specifically on database operations, performance optimization, and ensuring the reliability of our database platforms. Ideal candidates should have strong experience in database administration, SQL development, and AWS database services. Individuals in this role should be comfortable working in a fast-paced environment. Multitasking, in addition to communicating quickly and accurately, is critical to the success of anyone in this role.

  • Database Monitoring – 24/7 health monitoring of Pennymac's database infrastructure using tools such as AWS CloudWatch, New Relic, and database-specific monitoring solutions.
  • Database Incident Management – Coordinate with Pennymac's Incident Management team and key stakeholders to help resolve database-related production incidents quickly and accurately.
  • Database Administration – Responsible for various database administrative tasks including, but not limited to: user management, security configuration, backup validation, and query optimization.
  • Database Job Management – Monitoring and troubleshooting of database jobs, batch processes, and scheduled maintenance tasks. Ensure job execution meets SLA requirements and troubleshoot failures promptly.
  • SQL Development and Optimization – Write, review, and optimize SQL queries for performance. Analyze query execution plans and recommend improvements to application teams.
  • Database Performance Tuning – Investigate and troubleshoot database performance issues, including slow queries, blocking, deadlocks, and resource contention. Implement performance improvements and capacity planning recommendations.
  • Internal and External Escalation – Act as a point of escalation for any database-related production incidents. Ensure that key stakeholders are contacted in a timely manner to ensure quick and accurate resolution.
  • Change Management – Follow and enforce Pennymac's established Change Management processes and procedures for all database-related changes.
  • Communication – Monitor and respond to Call, Chat, and Email inquiries related to database operations sent to the SRO team.
  • Ticket Queue Management – Manage database-related requests using tools such as ServiceNow and JIRA to ensure deliverables are on time and accurate.
  • Documentation – Assist in maintaining the team's knowledge base of database support articles, runbooks, and Standard Operating Procedures. Play an active role in the creation of new database documentation as needed.
  • Database Backup and Recovery – Responsible for various tasks related to database backup management, validation, and recovery testing using tools like CommVault. Ensure backup integrity and recovery readiness.
  • Capacity Planning – Monitor database growth trends, analyze resource utilization, and provide recommendations for capacity planning and infrastructure scaling.
  • Database Security and Compliance – Maintain database security standards, access controls, and ensure compliance with regulatory requirements and company policies.
  • Bachelor’s Degree in Computer Science or comparable experience.
  • Advanced AWS Certifications strongly preferred.
  • 3–5+ years of experience working in both Windows and Linux environments, with demonstrated success in advanced troubleshooting and administration.
  • Proven proficiency in monitoring and alerting tools such as Nagios, New Relic, SumoLogic, AWS CloudWatch, and related technologies.
  • Strong scripting or programming skills in PowerShell, Python, or a similar language; ability to automate repetitive tasks and streamline operations.
  • Excellent organizational skills, with the ability to manage competing priorities and urgent issues in a fast-paced setting.
  • Strong written and verbal communication skills; able to explain complex technical issues to stakeholders at various technical levels.
  • Comfortable completing annual role-based training and certification assignments; dedicated to continual learning and development.
  • Demonstrated ability to work independently on complex tasks and to collaborate effectively with cross-functional teams.
  • Comprehensive Medical, Dental, and Vision
  • Paid Time Off Programs including vacation, holidays, illness, and parental leave
  • Wellness Programs, Employee Recognition Programs, and onsite gyms and cafe style dining (select locations)
  • Retirement benefits, life insurance, 401k match, and tuition reimbursement
  • Philanthropy Programs including matching gifts, volunteer grants, charitable grants and corporate sponsorships
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service