Cloud Reliability Engineer

PeratonChantilly, VA
10dOnsite

About The Position

Peraton is seeking a Cloud Reliability Systems Engineer in Chantilly, VA to support our Department of Defense customer as part of a highly talented, highly motivated and high-performing team. As part of the Infrastructure Operations and Maintenance Support team you will be responsible for the availability, performance, monitoring, and incident response, among other things, of the Cloud Infrastructure that we support in a 24x7 environment. What you'll do: Ensure the 24x7 uptime of our multi-service, multi-layer, multi-tenant cloud infrastructure This position is hands-on, requiring the ability to provide first-level system and network support problem investigation, resolution, or escalate as needed Work closely with the engineering teams to improve our platforms and eliminate complexity from architecture and processes Utilize a Jira Service Desk ticketing queue for tracking system and tenant issues through resolution Configure and use state-of-the-art monitoring tools to assist with troubleshooting and remediation of issues Responsible for the monitoring the daily software and network operations in a distributed multi-tenant, multi-layer, cloud environment Conduct incident response and in-depth root cause analysis Understand and troubleshoot complex network data flow issues for Palo Alto firewalls and Arista switches Document new procedures/update existing procedures utilizing Confluence and follow SOPs for conflict resolution This job will include shift work to allow for complete 24x7 monitoring of software systems. Will need to have flexibility to work multiple shifts (day, mid, swing), as needed. Job is on-site at Peraton Chantilly, VA facility. No remote work allowed

Requirements

  • Minimum of Top Secret clearance with SCI eligibility. Contract requires TS/SCI. The candidate must maintain the clearance
  • Associates degree in Engineering, Computer Technology or related field and 7+ years of experience; OR Bachelor’s degree in Engineering, Computer Technology or related field and 5+ years of experience; OR Master’s degree in Engineering, Computer Technology or related field and 3+ years of experience. An additional four (4) equivalent experience in lieu of a degree will be considered
  • 3+ years of experience working with Linux operating systems (RHEL 8.X or higher)
  • 3+ years of experience working within a cloud environment such as RedHat Openstack, RedHat OpenShift Container Platform (RHOCP), MS Azure, or AWS
  • 2+ years of experience with containerization and automation technologies (e.g. Docker Containers, Kubernetes, Ansible, and Heat templates)
  • Demonstrated experience in monitoring tools (e.g. Splunk, Sensu, Nessus, etc.)
  • Experience supporting software and/or network operations with a clear understanding of networking fundamentals with ability to deep-dive and troubleshoot issues
  • DoD 8140 compliance for work role: Technical Support Specialist, with a proficiency level of Intermediate or above
  • Must be willing to work in a 24x7 environment and support shift work on site; 8 and/or 12 hours shifts, including weekends

Nice To Haves

  • 3+ years of experience with virtualization technologies (e.g. Citrix XenServer Red Hat Enterprise Virtualization, and/or VMWare)
  • Experience using a ticketing system (e.g. Jira Service Desk, Remedy, etc.)
  • Experience with front-end processing and network gateway appliances and /or software
  • Networking experience to include monitoring, configuring (e.g. firewalls, switches, etc.)
  • Hands-on experience with virtual machine environment management tools (Vagrant)
  • Ability to work independently or within a team structure
  • Experience with configuration management repositories
  • Experience with cross domain guard technologies (e.g. Forcepoint, Radiant Mercury, etc.)
  • Knowledge and understanding of KVM Virtualization technologies
  • Previous experience with Intelligence or DoD programs, either within the military or as a civilian contractor
  • Excellent troubleshooting skills
  • Ability to effectively communicates both with customers and technical staff

Responsibilities

  • Ensure the 24x7 uptime of our multi-service, multi-layer, multi-tenant cloud infrastructure
  • Provide first-level system and network support problem investigation, resolution, or escalate as needed
  • Work closely with the engineering teams to improve our platforms and eliminate complexity from architecture and processes
  • Utilize a Jira Service Desk ticketing queue for tracking system and tenant issues through resolution
  • Configure and use state-of-the-art monitoring tools to assist with troubleshooting and remediation of issues
  • Responsible for the monitoring the daily software and network operations in a distributed multi-tenant, multi-layer, cloud environment
  • Conduct incident response and in-depth root cause analysis
  • Understand and troubleshoot complex network data flow issues for Palo Alto firewalls and Arista switches
  • Document new procedures/update existing procedures utilizing Confluence and follow SOPs for conflict resolution

Benefits

  • Peraton offers enhanced benefits to employees working on this critical National Security program, which include heavily subsidized employee benefits coverage for you and your dependents, 25 days of PTO accrued annually up to a generous PTO cap and eligible to participate in an attractive bonus plan

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

Associate degree

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service