Senior Devops Engineer

OVHcloudIrving, TX
1d

About The Position

The Senior DevOps Engineer is responsible for ensuring high availability, performance, monitoring, and incident response for OVHcloud Baremetal products and services. This position drives the reliability, development, configuration, and deployment of our current and future products and services. This includes investigating and debugging errors and submitting fixes that contribute to software development to improve our services. Base pay range: $120,000 - $136,500 (based on relevant experience).

Requirements

  • 5+ years of relevant experience is required, including DevOps, programming, and administration of Linux/Unix/Windows operating systems.
  • Experience performing day-to-day operational (DevOps) tasks and working with microservices and multiple APIs.
  • Experience with languages such as Python, Perl, Go, Bash, etc.
  • Experience with maintenance/configuration of monitoring, metrics, and logging infrastructures like Nagios, Grafana, Graylog.
  • Experience with virtualization and container technology.
  • Experience leading and working Major Incident responses and resolutions.
  • Experience leading the debugging effort and troubleshooting issues with deployed code.
  • Ability to efficiently prioritize, organize, and complete tasks throughout the workday and adjust when new priorities arise.
  • Bachelor’s degree in computer science or a related field, or equivalent and relevant experience preferred.

Nice To Haves

  • Experience with open-source configuration management tools such as Puppet, Ansible, etc. preferred.
  • Experience managing a distributed, highly available, high-traffic infrastructure based on Linux is preferred.
  • Well-versed in cloud technologies and terminology.

Responsibilities

  • Maintain essential OVHcloud infrastructures, products, and services.
  • User Acceptance Testing (UAT) for new product launches.
  • Author knowledge base articles and instructional guides as needed.
  • Automate tasks by developing scripts and tooling.
  • Participate in building, deploying, and/or troubleshooting of microservices software applications and other underlying APIs.
  • Responsible for monitoring the alerting systems and submitting configuration changes on a regular basis to ensure availability of systems and services.
  • Install, deploy, and configure OVHcloud infrastructure as new capabilities are developed.
  • Analyze data and develop meaningful automated reports to be used by technical and business leaders.
  • Diagnose errors with a data-driven approach, analysing and categorizing of data and logs, to resolve issues effectively and efficiently.
  • Lead the troubleshooting and debugging efforts for Major Incidents.
  • Write well-documented root cause analyses with recommended official documentation to prevent future critical issues.
  • Take part in on-call rotations, including weekend coverage.
  • Stay up to date with industry trends and emerging technologies.
  • Be a mentor and leader to other engineers on the team.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service