The Senior System Engineering role is responsible for providing 24x7 Tier 1 support for agent-facing applications including Salesforce, Salesforce Marketing Cloud, Mulesoft, OPUS, and Customer Account Lookup & Management (CALM). This position involves managing escalated issues, incidents, and outages, triaging them, and driving prompt resolution. The role requires providing timely visibility and status updates on these issues to leadership, business partners, and other key stakeholders. A significant aspect of the role involves Site Reliability Engineering (SRE) tasks, such as developing application knowledge bases, creating run books, and enhancing application observability through alerts, monitoring, and dashboards for proactive incident and problem detection. The Senior System Engineer will also triage incidents, assist Tier 2 in conducting blameless post-mortems, and collaborate with Release Management to identify and mitigate risks associated with production changes. Close work with Product Development and Tier 2 SRE teams is essential for knowledge transfer regarding system changes. The role also focuses on optimizing the Tier 1 on-call process and incident response workflow, including alert rules, communication methods, and response plans. Providing metrics and status reports, establishing processes for data gathering and reporting, and staying current on feature development to ensure system reliability are key duties. Additionally, the role involves assisting in the development and maintenance of technology operations and support Standard Operating Procedures (SOPs) and T1 documentation based on industry best practices. The position requires technical leadership with strong communication skills and the ability to foster a self-motivated team, conducting rigorous due diligence on all plans.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior