Peraton is seeking a Cloud Reliability Systems Engineer in Chantilly, VA to support our Department of Defense customer as part of a highly talented, highly motivated and high-performing team. As part of the Infrastructure Operations and Maintenance Support team you will be responsible for the availability, performance, monitoring, and incident response, among other things, of the Cloud Infrastructure that we support in a 24x7 environment. What you'll do: Ensure the 24x7 uptime of our multi-service, multi-layer, multi-tenant cloud infrastructure This position is hands-on, requiring the ability to provide first-level system and network support problem investigation, resolution, or escalate as needed Work closely with the engineering teams to improve our platforms and eliminate complexity from architecture and processes Utilize a Jira Service Desk ticketing queue for tracking system and tenant issues through resolution Configure and use state-of-the-art monitoring tools to assist with troubleshooting and remediation of issues Responsible for the monitoring the daily software and network operations in a distributed multi-tenant, multi-layer, cloud environment Conduct incident response and in-depth root cause analysis Understand and troubleshoot complex network data flow issues for Palo Alto firewalls and Arista switches Document new procedures/update existing procedures utilizing Confluence and follow SOPs for conflict resolution This job will include shift work to allow for complete 24x7 monitoring of software systems. Will need to have flexibility to work multiple shifts (day, mid, swing), as needed. Job is on-site at Peraton Chantilly, VA facility. No remote work allowed
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
Associate degree
Number of Employees
5,001-10,000 employees