At Early Warning, we’ve powered and protected the U.S. financial system for over thirty years with cutting-edge solutions like Zelle®, Paze℠, and so much more. As a trusted name in payments, we partner with thousands of institutions to increase access to financial services and protect transactions for hundreds of millions of consumers and small businesses. Positions located in Scottsdale, San Francisco, Chicago, or New York follow a hybrid work model to allow for a more collaborative working environment. Candidates responding to this posting must independently possess the eligibility to work in the United States, for any employer, at the date of hire. This position is ineligible for employment Visa sponsorship. Overall Purpose The Principal Site Reliability Engineer partners with development teams by designing availability and resiliency patterns in applications and infrastructure. Essential Functions: Design and Implement software and tools to improve the performance - availability, scalability, and latency, while delivering end products to customer with the highest efficiency and meeting all security standards. Supports the company’s commitment to risk management and protecting the integrity and confidentiality of systems and data. Build automation and tooling around application management, such as deployments, configuration changes and disaster recovery scenarios. Design, Implement and evangelize Observability and monitoring systems to proactively detect problems and identify cause. Evaluate capacity of the application on a continuous basis to provide stats to the Product/Business teams and recommend an efficient path to scale for future needs. Identify performance bottlenecks and work with cross-functional teams to troubleshoot and resolve issues. Serve as a technical liaison for the application and provide documents and runbooks to Level 1 and Level 2 teams. Participate in 24 X 7 on-call rotation. Be a champion of excellent processes; take the initiative in developing repeatable patterns and standard, re-usable work across teams. Work directly with application development teams to provide feedback and technical requirements to the software development lifecycle, implementing best-practice microservice design patterns and other modern software development approaches. Understand and support the adoption of best-practice microservice design patterns and other modern software reliability approaches and techniques. Be a thought leader: a senior point of expertise on site reliability engineering issues, industry trends and developing technologies. Be a role model to others on the team . Coach and mentor team members.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
251-500 employees