This role involves leading the response to production issues, ensuring minimal downtime and adherence to SLAs. The Senior System Engineer will build alerting, monitoring, and dashboards for proactive problem identification. They will use strong analytical and technical skills to diagnose and resolve complex production issues, focusing on immediate impact mitigation and automating recovery processes. The role also includes working with development teams on long-term solutions, creating and maintaining system documentation, and developing scripts and automation tools. A key aspect is identifying and ensuring non-functional requirements like reliability, performance, and scalability are met before production deployment. The engineer will monitor application performance using tools like Dynatrace and App Dynamics, identify bottlenecks, and optimize application performance. Defining SLI/SLOs and Error Budgets, and working with teams to document failure patterns and implement remediations for application resilience are also responsibilities. Capacity planning, participating in security assessments, responding to security incidents, and collaborating with Release Management on production changes are expected. The role requires supporting application releases and deployments, ensuring controlled rollouts with minimal impact. Proactive problem detection, trend analysis, and providing metrics and status reports to leadership are crucial. Strong communication skills are essential, as is knowledge transfer with Product Development teams. The position requires 24x7 on-call support for various applications, including J2EE apps, Salesforce, Salesforce Marketing Cloud, and MuleSoft, using an SRE approach. Experience with Java EE apps, ERP, CRM apps, web application architecture and development, and various observability tools is necessary. Proficiency in integration technologies, API Gateways, MuleSoft, WebLogic, Object-Oriented Programming languages (Java, J2EE, JavaScript, Spring), automation tools (Python, Shell), containerization (Docker, Kubernetes), cloud services (Azure), DevOps practices (CI/CD, Git, Jenkins), network protocols, load balancing, security principles, SQL queries, and Linux shell scripting is required.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior