CDN Site Reliability Engineer L4 - Open Connect

NetflixLos Gatos, CA
81d$170,000 - $720,000

About The Position

Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time. How do you spark joy in hundreds of millions of people? It starts with a vision - that technology can give voice to stories around the world. In delivering those much-loved stories, Netflix is responsible for a significant portion of global internet traffic. To steward that responsibility, we work collaboratively with ISPs to deploy Open Connect, Netflix's Content Delivery Network (CDN), our in-house custom-built network and server infrastructure responsible for delivering 100% of Netflix's video traffic. We strive to deliver a great Netflix viewing experience in over 190 countries so our customers can watch whatever, whenever, interruption free. We are seeking a Reliability Engineer with experience in *nix, networking, data analysis, and large-scale platform operations experience to design, scale, operate, automate, and analyze our globally distributed CDN. Come join us and play a meaningful role in our journey to entertain the world!

Requirements

  • 3+ years of Service Reliability/Operational experience running large-scale, high-performance systems & internet services with a focus on performance and reliability.
  • Strong working knowledge of networking concepts and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S with focused experience on CDNs and HTTP cache/proxy technologies.
  • Skilled in designing, creating and maintaining automation written in a programming language such as Python.
  • Expert-level knowledge managing and debugging Unix/Linux systems (engineering fundamentals, networking, storage, operating systems) at scale.
  • Experience with distributed analytic processing technologies (Presto/Trino, Spark SQL, etc).
  • Strong understanding of applied statistics and the ability to code systems that identify outlier behavior in large systems.
  • Some experience with container and container orchestration technologies (Docker).
  • Ability to work in a highly collaborative environment and to communicate cross functionally with internal and external partners.

Responsibilities

  • Drive continual improvement in resiliency, observability, monitoring, instrumentation, and automation with the primary goal of maintaining a highly scalable and reliable CDN platform worldwide.
  • Aggregate, analyze, and correlate large amounts of server and application performance data.
  • Use the innovative Netflix Big Data platform as a highly flexible, specialized and efficient toolset to identify opportunities for platform optimization, system reliability improvements as well as identifying patterns/anomalies for further investigation.
  • Provide technical design and engineering assistance to ISP partners to integrate our Open Connect Appliances.
  • Handle Tier 3 escalation and participate in an on-call rotation for the CDN platform production issues.

Benefits

  • Health Plans
  • Mental Health support
  • 401(k) Retirement Plan with employer match
  • Stock Option Program
  • Disability Programs
  • Health Savings and Flexible Spending Accounts
  • Family-forming benefits
  • Life and Serious Injury Benefits
  • Paid leave of absence programs
  • Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off.
  • Full-time salaried employees are immediately entitled to flexible time off.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Motion Picture and Sound Recording Industries

Education Level

Bachelor's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service