Cloud and Application Infrastructure Systems Engineer

LLNLLivermore, CA
$146,340 - $222,564Hybrid

About The Position

We have an opening for a Cloud and Application Infrastructure Systems Engineer . You will architect, implement, and administer the cloud and application infrastructure required to support enterprise applications across the Laboratory’s business services and programmatic missions. This position is in the Enterprise Infrastructure Services (EIS) Division in the Computing Directorate, supporting the LivIT Program. This position offers a hybrid schedule, blending in-person and virtual presence. You will have the flexibility to work from home one or more days per week. This position will be filled at either the SES.2 or SES.3 level based on knowledge and related experience as assessed by the hiring team. Additional job responsibilities (outlined below) will be assigned if hired at the higher level. In this role you will Architect and implement highly available application infrastructures required to support enterprise IT service delivery. Leverage systems programming expertise to automate and manage service orchestration, monitoring, configuration management, security, and patch management of application infrastructures and services. Collaborate with customers, IT managers, developers, team members, systems administrators, database administrators, and security administrators to jointly develop highly available application infrastructures & services that meet business & programmatic needs. Manage systems architecture throughout their systems development lifecycles, including risk analysis, capacity planning, highly available systems architecture and design, development, quality assurance, documentation, standards enforcement, implementations and upgrades, security compliance & vulnerability remediation, and performance monitoring. Operationally support application infrastructure technologies (For Example: Java Play, Oracle WebLogic, Apache, Tomcat, IIS, load balancers, API gateways, or DockerEE/AWS EKS), and various enterprise commercial off the shelf (COTs) stacks. Provide superior problem remediation support within the application tier environments in support of negotiated Service Level Agreements (SLA’s). Support the transition of our existing virtualized application infrastructures into containerized services to help drive configuration standardization & agility, CI/CD automation & orchestration, and security enhancement objectives. Perform other duties as assigned.

Requirements

  • Ability to obtain and maintain a US DOE Q-level security clearance which requires U.S. Citizenship.
  • Bachelor's degree in Computer Science, Software Engineering, Management Information Systems, or related field, or the equivalent combination of education and related experience.
  • Broad experience in implementing and managing application and cloud service infrastructure; including monitoring, operational support, security, and lifecycle management.
  • Experience with systems programming and/or automation tools to create, configure, monitor, and automate the management of application infrastructures required to maintain the requisite availability and security posture of deployed services.
  • Broad experience collaborating with customers, IT managers, developers, database administrators, systems administrators, and security administrators to jointly create application infrastructure services that meet business requirements.
  • Experience with public and/or private cloud services, and UNIX and/or Windows O/S administration, security, and performance tuning.
  • Experience managing application infrastructure technologies, such as Java Play, Oracle WebLogic, Apache, Tomcat, IIS, load balancers, API gateways, DockerEE/AWS EKS, or various enterprise commercial off the shelf (COTs) stacks.
  • Proficient communication, facilitation, and collaboration skills necessary to effectively present, explain, and advise senior management and/or external sponsors.
  • Perform other duties as assigned.
  • Advanced experience with systems programming to create, configure, monitor, and automate the management of complex application infrastructures required to maintain the requisite availability and security posture of deployed infrastructures and associated services.
  • Significant experience architecting and managing application and cloud service infrastructures; including implementation, monitoring, operational support, security, and lifecycle management.
  • Advanced experience in enterprise observability and alerting, including designing and implementing golden-signal dashboards (latency, traffic, errors, saturation), establishing SLO/SLA-based alerting strategies with mature on-call readiness and documented runbooks, and integrating centralized log management platforms (e.g., Datadog Logs or Splunk) to correlate logs with metrics and distributed traces for rapid incident detection, triage, and resolution.
  • Advanced experience in infrastructure monitoring and automation, delivering monitoring across Kubernetes/EKS, VMs, load balancers, and web/app servers, and automating monitors, dashboards, and configuration via Ansible and APIs.

Nice To Haves

  • Master’s degree in Computer Science, Computer Engineering, or Management Information Systems.
  • Experience leveraging Splunk or equivalent SIEM environment to operationally manage application infrastructure services.
  • Experience working in a compliance-driven and secure environment, as well as project management experience with large, enterprise COTs application implementations.

Responsibilities

  • Architect and implement highly available application infrastructures required to support enterprise IT service delivery.
  • Leverage systems programming expertise to automate and manage service orchestration, monitoring, configuration management, security, and patch management of application infrastructures and services.
  • Collaborate with customers, IT managers, developers, team members, systems administrators, database administrators, and security administrators to jointly develop highly available application infrastructures & services that meet business & programmatic needs.
  • Manage systems architecture throughout their systems development lifecycles, including risk analysis, capacity planning, highly available systems architecture and design, development, quality assurance, documentation, standards enforcement, implementations and upgrades, security compliance & vulnerability remediation, and performance monitoring.
  • Operationally support application infrastructure technologies (For Example: Java Play, Oracle WebLogic, Apache, Tomcat, IIS, load balancers, API gateways, or DockerEE/AWS EKS), and various enterprise commercial off the shelf (COTs) stacks.
  • Provide superior problem remediation support within the application tier environments in support of negotiated Service Level Agreements (SLA’s).
  • Support the transition of our existing virtualized application infrastructures into containerized services to help drive configuration standardization & agility, CI/CD automation & orchestration, and security enhancement objectives.
  • Perform other duties as assigned.
  • Provide solutions to complex problems that require in-depth analysis of tangible and intangible factors and the creative use of established and/or innovative methods in architecting and designing application infrastructures and services.
  • Leverage advanced subject matter expertise in one or more application infrastructure engineering areas to support the development of advanced application architectures.
  • Leverage advanced subject matter expertise in systems programming, automation, and orchestration tooling to automate the creation, configuration, and management of complex systems.

Benefits

  • Flexible Benefits Package
  • 401(k)
  • Relocation Assistance
  • Education Reimbursement Program
  • Flexible schedules (depending on project needs)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service