About The Position

Leidos was awarded the U.S. Air Force Cloud One Architecture and Common Shared Services contract and currently has an opening for a DevOps/SRE Manager supporting AWS, Azure, Google, and Oracle clouds. This is an exciting opportunity to use your experience to modernize a leading, global-scale multi-cloud environment in support of a critical mission, supporting USAF system resiliency, security, and cost effectiveness. Location: These positions will be hybrid remote. Candidates will be required to work onsite as needed. Preferred candidates will be located near Hanscom AFB (Boston, MA) or work in Huntsville, AL. Primary Responsibilities Could Include: As the DevOps/SRE manager, you will develop in a scalable cloud-native solutions, and ensure best practices across architecture, development, deployment, and security and you will lead a group of DevOps/SRE engineers. This role is essential to ensuring secure, scalable, and resilient connectivity across hybrid and multi-cloud environments. You’ll work closely with cloud engineers, cybersecurity analysts, and program leadership to drive continuous improvement and deliver value to the mission.

Requirements

  • Bachelors and five (5) years or more of experience; Masters and three (3) years or more of experience. Additional experience may be accepted in lieu of degree.
  • 2+ years of prior experience managing/leading teams/projects
  • Interim Secret clearance required to start; Ability to obtain Secret clearance required to maintain employment
  • US citizenship required
  • Certifications: CompTIA Security+ or equivalent (IAT-2)
  • Excellent facilitation, communication, and stakeholder engagement skills
  • Ability to work in a fast-paced, mission-driven environment
  • Strong documentation, communication, and cross-functional collaboration skills
  • Familiarity with DevSecOps principles and practices.
  • Familiarity with Agile methodologies such as Scrum and/or Kanban.
  • Excellent customer service skills, with experience working in a customer-facing position
  • Strong knowledge of security principles, including threat modeling, vulnerability assessments, and encryption techniques.
  • Deep understanding of CI/CD tools (e.g., Jenkins, GitLab CI, GitHub Actions, Azure DevOps, Argo CD).

Nice To Haves

  • Experience with USAF Cloud One or Platform 1
  • Experience with Zero Trust Architecture
  • Cloud certifications in AWS, Azure, Google, or Oracle clouds
  • Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, Splunk, ELK Stack).
  • Solid understanding of networking, Linux/Unix systems, and version control systems (e.g., Git)
  • Proficiency in programming/scripting languages (e.g., Python, Java, Bash, Go).
  • Experience with configuration management and orchestration tools (e.g., Terraform, Ansible, Puppet).
  • Hands-on experience with containerization and orchestration (e.g., Docker, Kubernetes).
  • Example certifications include: Industry Professional certification Certified Kubernetes Application Developer (CKAD), Kubernetes and Cloud Native Associate (KCNA), AWS Certified DevOps Engineer, Certified AWS SysAdmin, AWS Certified Advanced Networking, AWS Certified Security, Azure Developer Associate, Azure Solutions Architect

Responsibilities

  • Lead a group of 5-15 DevOps and SRE engineers to fulfill the requirements for the program
  • Provide the leadership using software engineering principles to build and maintain scalable, highly reliable, and performant large-scale systems
  • Design, implement, and maintain CI/CD pipelines for secure, automated software delivery.
  • Design and implement highly available, fault-tolerant systems for Amazon Web Services, Microsoft Azure, Google Cloud Platform, Oracle Cloud Infrastructure
  • Define and monitor SLIs, SLOs, and SLAs to ensure service reliability and performance
  • Implement robust monitoring, logging, and alerting using tools such as Prometheus, Grafana, Azure Monitor, and CloudWatch
  • Lead incident response and postmortem processes to drive continuous improvement
  • Collaborate with development teams to embed reliability into application design and deployment
  • Lead capacity planning including forecasting resource needs to ensure systems can scale effectively.
  • Ensure compliance with security best practices, including IAM, VPC design, and encryption standards
  • Implement DevSecOps pipelines for a variety of technical stacks on Amazon Web Services, Microsoft Azure, Google Cloud Platform, Oracle Cloud Infrastructure.
  • Develop infrastructure as code (IaC) using tools such as Terraform, Ansible, or CloudFormation.
  • Deploy and manage applications on cloud platforms such as AWS, Azure, Google Cloud or Oracle Cloud Infrastructure (OCI).
  • Configure and optimize container orchestration platforms (e.g., Kubernetes, Docker).
  • Maintain high availability, scalability, and performance of cloud-based systems.
  • Configure and maintain virtualized environments to ensure performance, scalability, and security
  • Support infrastructure modernization efforts by integrating virtualization solutions into hybrid cloud environments.
  • Implement automated security tools for vulnerability scanning, static/dynamic application security testing (SAST/DAST), and container security.
  • Drive consistency for deployment and build processes
  • Establish proactive monitoring solutions to ensure system reliability and availability.
  • Respond to and troubleshoot production incidents, performing root cause analysis and resolution.
  • Embed security best practices into the SDLC and CI/CD processes.
  • Develops strategy and integration methodology around design, development and implementation of cloud based solutions
  • Enable the quick development and release of changes and bug-fixes on an as-required basis and incorporate feedback from developers/users.
  • Prepare detailed technical documentation to support development and operational processes
  • Partner with business stakeholders to understand requirements and translate them into technical solutions
  • Present architectural designs and recommendations to executive leadership
  • Mentor, guide and supervise teams for related activities
  • Lead reviews and provide guidance on complex technical decisions
  • Prepare detailed technical documentation to support development and operational processes
  • Collaborate with team members and provide mentorship to junior staff, fostering a learning environment
  • Act as the DevOps/SRE manager to assess employee performance, hire new employees, and ensure compliance with corporate training requirements

Benefits

  • Pay and benefits are fundamental to any career decision. That's why we craft compensation packages that reflect the importance of the work we do for our customers.
  • Employment benefits include competitive compensation, Health and Wellness programs, Income Protection, Paid Leave and Retirement.
  • More details are available here.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service