We are looking for highly skilled Java Site Reliability Engineers (SRE) with robust Java application knowledge and hands-on experience in Production Support SRE. The team should be capable of performing code-level changes for minor bug fixes as needed, including code deployment and automation using Python. Additionally, they should be able to create alerts and dashboards for monitoring production system application health, set up alerts based on application logs (e.g., Splunk) as per defined SLA, create alerts for application-level process health checks, and develop dashboards for monitoring infrastructure and performance, including database query performance and API call performance.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Career Level
Mid Level
Industry
Professional, Scientific, and Technical Services
Number of Employees
251-500 employees