Computing Products SRE Engineer

Tencent LTDPalo Alto, CA
44d

About The Position

Service Reliability: Monitor and maintain Tencent Cloud's computing related products in North American region to ensure stability and reliability. Service Deployment & upgrade: Utilize tools or platforms such as CI/CD, intelligence, and data to improve the overall efficiency of the operations team, drive products deployment, upgrade and architecture optimization in region. Troubleshooting: Participate in the troubleshooting of computing products and resolve problems encountered by customers in the region. Automation & Monitoring: Develop scripts, tools, and services to monitor, analyze, and enhance service quality and operations. Process Improvement & Documentation: Re-engineer and document operational processes to improve service efficiency and delivery.

Requirements

  • Bachelor's degree or higher in Computer Science or a related field.
  • Ability to communicate in English and Mandarin with international teams preferred but not required.
  • Deep understanding of Linux operating systems, including kernel, memory, processes, threads, IPC, and signals, with strong troubleshooting skills.
  • Proficiency in standard networking protocols and components such as HTTP, DNS, TCP/IP, ICMP, Subnetting, and Load Balancing.
  • Strong understanding of hardware performance and how it impacts system operations.
  • Proficiency in at least one scripting language, such as Python or Shell, with a proven track record in automation.
  • Ability to take ownership of issues, troubleshoot independently, and find creative solutions or escalate when needed.

Nice To Haves

  • Extensive experience in kernel and KVM fault debugging, and proficiency in virtualization technologies such as KVM/Docker.
  • Contributions to open-source projects are a plus.
  • Preference for candidates with Tencent Cloud Qualification Certificate or equivalent qualifications.

Responsibilities

  • Monitor and maintain Tencent Cloud's computing related products in North American region to ensure stability and reliability.
  • Utilize tools or platforms such as CI/CD, intelligence, and data to improve the overall efficiency of the operations team, drive products deployment, upgrade and architecture optimization in region.
  • Participate in the troubleshooting of computing products and resolve problems encountered by customers in the region.
  • Develop scripts, tools, and services to monitor, analyze, and enhance service quality and operations.
  • Re-engineer and document operational processes to improve service efficiency and delivery.

Benefits

  • Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis.
  • Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company's 401(k) plan.
  • The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee's tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year.
  • Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level.
  • Benefits may also be pro-rated for those who start working during the calendar year.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Broadcasting and Content Providers

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service