Senior Infrastructure Engineer II

DigitalOceanSeattle, WA
12dRemote

About The Position

Dive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, you’ll find your place here. We value winning together—while learning, having fun, and making a profound difference for the dreamers and builders in the world. We want people who are passionate about designing and operating secure systems at scale We are looking for an experienced , motivated, adaptable, empathetic automation-focused infrastructure engineer who is comfortable working remotely. You will report to the Engineering Manager of the Foresight team, with a primary mission of “Help deliver GPU systems rapidly”. You will architect, build, support, and scale the team’s Provisioning Automation system. This system will be used to quickly and reliably provision hardware at DigitalOcean. This position will involve a high degree of leadership, ownership and autonomy - we will need to release this system quickly, and be prepared to scale it 10x. This is a fast-paced role with a lot of opportunity. In addition to our primary mission, we have many other responsibilities. For example: We develop and support several infrastructure systems built in golang We develop and maintain several fleet visualization utilities, written in golang and react We write small system utilities or daemons that run on physical hosts, and report metrics about them We expose metrics to leadership for intelligent decisionmaking , including system firmware versions, provisioning success rates, etc. We help operational teams meet deadlines by keeping them informed of project progress Our team has a big scope, but don’t let it deter you - we’re a group of kind folks. More than anything, we’re looking for someone empathetic , motivated , and driven to grow with us. Also, we’re looking to expand our team’s StackStorm expertise . If you have StackStorm experience, that’s a bonus! DigitalOcean’s Internal Culture and Tooling: DigitalOcean teams communicate primarily via Slack . Foresight makes light use of Jira and GSuite . We strive to make our work-life balance comfortable, and aim to scope high-impact work appropriately so that everyone works at a healthy pace. You can expect to be on-call periodically once you are ready, but shouldn’t expect to be paged often. DigitalOcean’s observability platform comprises VictoriaMetrics , Grafana , Alertmanager , and Elasticsearch . Knowing any of these tools is a bonus, because every service at DO is generally expected to use this platform. The Foresight team is an arm of the Hardware Lifecycle Engineering (HLE) organization. We are aimed at boosting productivity by enabling our engineers to rapidly and reliably deploy hardware in various configurations, managing the lifecycle from standup to decommission. The HLE group is made up of a diverse group of roughly 14 engineers located across the US, Canada, and Europe. The Foresight team accounts for approximately 30% of the HLE group. Within Foresight, there are growth opportunities along several tracks (i.e. Tech Leader, Subject Matter Expert (SME), Project Management, Engineering Manager, etc).

Requirements

  • Programming Languages: python, golang, shell
  • Systems: Linux, Containers, StackStorm, Ansible
  • Theory: Distributed Systems, Complex System Failure, Resilient Architecture, Quality Engineering
  • Significant experience administering Linux servers
  • Strong experience with Python, Ruby, or Golang
  • Familiarity with git
  • Familiarity with shell scripting
  • Familiarity with continuous integration systems and concepts
  • An interest in contributing work upstream
  • Excellent written and verbal English communication skills
  • Comfort executing in an asynchronous remote environment
  • Transparency, honesty, and openness to constructive feedback
  • A desire to work with a respectful and inclusive team

Nice To Haves

  • Also, we’re looking to expand our team’s StackStorm expertise . If you have StackStorm experience, that’s a bonus!
  • Familiarity with Github Actions is a plus
  • DigitalOcean’s observability platform comprises VictoriaMetrics , Grafana , Alertmanager , and Elasticsearch . Knowing any of these tools is a bonus, because every service at DO is generally expected to use this platform.

Responsibilities

  • Developing impactful, new and innovative systems that will help DigitalOcean scale
  • Responding to provisioning failures
  • Working to ensure that common provisioning failures do not recur (likely via automation)
  • Collaborating with sibling teams to deliver on wider organizational goals
  • Bringing new and actionable information to light via developing visualization tooling
  • Having fun with an amazing and welcoming team 🙂

Benefits

  • We prioritize career development. At DO, you’ll do the best work of your career. You will work with some of the smartest and most interesting people in the industry. We are a high-performance organization that will always challenge you to think big. Our organizational development team will provide you with resources to ensure you keep growing. We provide employees with reimbursement for relevant conferences, training, and education. All employees have access to LinkedIn Learning's 10,000+ courses to support their continued growth and development.
  • We care about your well-being. Regardless of your location, we will provide you with a competitive array of benefits to support you from our Employee Assistance Program to Local Employee Meetups to flexible time off policy, to name a few. While the philosophy around our benefits is the same worldwide, specific benefits may vary based on local regulations and preferences.
  • We reward our employees. The salary range for this position is based on market data, relevant years of experience, and skills. You may qualify for a bonus in addition to base salary; bonus amounts are determined based on company and individual performance. We also provide equity compensation to eligible employees, including equity grants upon hire and the option to participate in our Employee Stock Purchase Program.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

1,001-5,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service