As part of the Secure the Enterprise initiative, develop capabilities to shift from the current manual system security evaluation and authorization process to a new model that emphasizes automation, streamlined processes and approvals, continuous monitoring and assessment, and network data gathering across the entire life cycle of a project. This is a blended position across all teams in ANDS. As such, the successful candidate will have skills crossing from Software Development, Systems Engineering, Systems Administration, and Operations Support. Ensure data reliability and accuracy by monitoring and maintaining application systems. Contribute to operational success through continuous performance monitoring and proactive troubleshooting, to potentially include changes to system configuration, changes to the code base, etc. Maintain system health by tracking metrics, logs, dashboards, alerts, and application status to prevent downtime and ensure optimal application performance. Support watchfloor operations by identifying service degradation, failed dataflows, system errors, and application issues, and escalating as needed. Perform basic Linux system administration tasks, including checking service status, starting, stopping, and restarting services, reviewing logs, validating disk, memory, and CPU usage, and supporting server reboots. Support cloud-based operations by using the AWS Console or AWS CLI to check instance health, review system status, restart or reboot servers, and assist with basic operational troubleshooting. Work in a rotating shift schedule, 6AM-6PM / 6PM-6AM, to provide 24/7 application support on our watch floor.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
High school or GED