We are seeking a Senior Distributed Storage SRE Engineer to be responsible for the daily operation and maintenance of distributed storage systems. This includes online release, software deployment, monitoring, and inspection. The role focuses on ensuring the stability of block storage, designing and implementing disaster recovery solutions, and optimizing service reliability, scalability, and performance to guarantee system SLA. You will also manage and plan resources for block storage and related systems to enhance efficiency, and participate in building the operation and maintenance support platform by developing tools to improve operational efficiency. A key part of this role is responding quickly to online incidents, discovering, debugging, and solving common faults, hidden dangers, and performance problems, and implementing emergency plans and fault recovery strategies.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior