Senior Linux System Admin - Federal

ServiceNowSan Diego, CA
47d

About The Position

Please Note: This position will include supporting our US Federal customers. "This position requires passing a ServiceNow background screening, USFedPASS (US Federal Personnel Authorization Screening Standards). This includes a credit check, criminal/misdemeanor check and taking a drug test. Any employment is contingent upon passing the screening. Due to Federal requirements, only US citizens, US naturalized citizens or US Permanent Residents, holding a green card, will be considered. The Team As a key member of the Systems Administration team within Operations Engineering, you will be responsible for the administration and operations of the global cloud infrastructure that runs our SaaS product. This is an opportunity to be at the core of running a Cloud SaaS platform that scales to millions of users! The Cloud Operations team is responsible for ensuring the availability and efficiency of the server infrastructure that runs our SaaS platform while consuming and deploying products that have been newly developed by engineering teams. You will be working closely with engineers and developers across the company. What you get to do in this role: Contribute to Configuration Management and Infrastructure as Code for ServiceNow's global private cloud. Develop tools in Python, bash, and JavaScript to replace manual work and improve customer maintenance experience. Drive enhancements and bugfixes for large scale automation projects such as patching, provisioning, and kickstart domains. Design and implement procedure to accomplish maintenances where automation and tooling cannot; drive resolution of root causes with internal team members. Prepare new ServiceNow products and services for production readiness with design review, feedback to engineering teams, training, and testing. Use broad knowledge and experience of systems administration and networking principles to proactively prevent and address incidents while constantly improving documentation. Participate in escalations and Root Cause Analysis of issues in both US Federal and global Commercial infrastructures. Troubleshoot database backup and restore failures as well as perform database migrations. Support operation of a wide variety of infrastructure services including Machine Learning and Prediction, Cloudera Big Data clusters, Kafka and RabbitMQ messaging, database encryption, E-Mail infrastructure at scale, DNS, Puppet, Elasticsearch, F5 BigIP, and more.

Requirements

  • The ideal candidate will have expert-level skills and background in systems administration and engineering, understanding of the components of a cloud infrastructure including hardware platforms, OS, applications, databases, networks, web and application servers. Prior experience in Site Reliability Engineering/DevOps and managing large-scale server infrastructure at a cloud computing or MSP setting is highly desirable.
  • Strong Linux expertise is a must.
  • 4+ years of experience with Linux (RedHat and/or CentOS)
  • Experience with performance and availability monitoring, analysis, and configuration management platforms (e.g. Nagios/Icinga, Cacti, Ansible, Puppet, cfengine, chef, Splunk, Logstash) is desirable.
  • Working level knowledge of one: Perl, Python, JavaScript
  • Familiarity with MySQL, Oracle, MariaDB, or similar technologies; proficiency preferred
  • Expert-level skills and experience with service troubleshooting in a production environment covering web front-end, Systems, Databases and Networks.
  • Familiarity with Networking Technologies such as routing, switching and load balancing. F5 and NGINX experience is ideal.
  • Understanding of ITIL v3 framework and how it applies to incidents, problems and change.
  • Candidate must have good communication skills and work well in a collaborative team environment.

Responsibilities

  • Contribute to Configuration Management and Infrastructure as Code for ServiceNow's global private cloud.
  • Develop tools in Python, bash, and JavaScript to replace manual work and improve customer maintenance experience.
  • Drive enhancements and bugfixes for large scale automation projects such as patching, provisioning, and kickstart domains.
  • Design and implement procedure to accomplish maintenances where automation and tooling cannot; drive resolution of root causes with internal team members.
  • Prepare new ServiceNow products and services for production readiness with design review, feedback to engineering teams, training, and testing.
  • Use broad knowledge and experience of systems administration and networking principles to proactively prevent and address incidents while constantly improving documentation.
  • Participate in escalations and Root Cause Analysis of issues in both US Federal and global Commercial infrastructures.
  • Troubleshoot database backup and restore failures as well as perform database migrations.
  • Support operation of a wide variety of infrastructure services including Machine Learning and Prediction, Cloudera Big Data clusters, Kafka and RabbitMQ messaging, database encryption, E-Mail infrastructure at scale, DNS, Puppet, Elasticsearch, F5 BigIP, and more.

Benefits

  • health plans, including flexible spending accounts, a 401(k) Plan with company match, ESPP, matching donations, a flexible time away plan and family leave programs (subject to eligibility requirements).

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Professional, Scientific, and Technical Services

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service