Senior Systems Operations Engineer – Infrastructure Development

Wells FargoMinneapolis, MN
3d$42 - $81Hybrid

About The Position

About this role: Wells Fargo is seeking a seasoned Senior Systems Operations Engineer to join our App and Web Engineering team and build the automation foundations that provision, manage, and scale our enterprise application and web server hosting platforms. This role is ideal for a hands‑on engineer who understands the operational complexity of running large‑scale server environments and is passionate about enabling frictionless self-service through automation. You will lead by example-designing and implementing modular IaC components and GitOps workflows that abstract the complexity of provisioning and managing application/web servers, configuring runtime settings, tuning performance parameters, enforcing security policies, integrating with routing layers, and ensuring end‑to-end observability. In this role, you will: Lead large‑scale initiatives to automate provisioning, configuration, and lifecycle operations for application and web server platforms (Tomcat, Apache HTTPD, IBM Liberty, NGINX, etc.) Architect and develop reusable IaC components (Ansible) for server installation, configuration management, clustering, routing, JVM tuning, certificate automation, and policy enforcement Develop automation scripts and workflows using Python or Java to support provisioning, configuration, governance, certificate management, and operational efficiency Develop robust APIs using Java Spring Boot or Python to expose provisioning, configuration, deployment governance, certificate management, and capacity automation workflows Design and implement GitOps‑driven workflows to automate server configuration updates—such as routing rules, reverse proxy updates, JVM or container runtime changes, TLS rotation, module/plugin configuration, and environment policies Build and maintain self‑service platform capabilities enabling developers to request server instances, deploy applications, configure routing, request certificates, manage JNDI/resources, and consume metrics through APIs or service catalogs Collaborate across engineering, security, and product teams to ensure platform automation aligns with organizational goals and best practices Participate in architecture and code reviews while mentoring engineers on server operations, IaC design patterns, and automation best practices Continuously improve platform reliability, performance, scalability, and operational efficiency through automation modernization and engineering excellence.

Requirements

  • 4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 3+ years of full‑stack or backend software development experience using Java and/or Python
  • 3+ years of hands-on experience deploying and operating one or more application/web server technologies (Tomcat, Apache HTTPD, IBM Liberty, NGINX, etc.)
  • 3+ years of experience with IaC tools such as Terraform and Ansible
  • 3+ years of experience implementing GitOps or similar automation practices
  • 1+ year of experience with Kubernetes/OCP, containerization, and hybrid cloud platforms (AWS, Azure, GCP)
  • 2+ years of experience designing and consuming RESTful APIs and integrating automation into platform services.

Nice To Haves

  • Deep understanding of server internals across one or more platforms (e.g., connectors, routing engines, thread pools, modules/plugins, classloading, cluster coordination, reverse proxy behavior)
  • Experience integrating application/web servers with load balancers, API gateways, and service meshes.
  • Experience implementing enterprise features such as JNDI/JDBC (Tomcat/Liberty), reverse‑proxy modules (Apache/NGINX), or Liberty features/packs
  • Familiarity with designing scalable hosting architectures and deployment pipelines for Java and web applications
  • Experience with HA and DR patterns spanning multi‑AZ or multi‑region deployments across server platforms
  • Hands‑on experience with observability tools (Prometheus, Grafana, ELK) and platform‑specific monitoring interfaces (e.g., JMX, mod_status, NGINX stub_status, Liberty admin metrics).

Responsibilities

  • Lead large‑scale initiatives to automate provisioning, configuration, and lifecycle operations for application and web server platforms (Tomcat, Apache HTTPD, IBM Liberty, NGINX, etc.)
  • Architect and develop reusable IaC components (Ansible) for server installation, configuration management, clustering, routing, JVM tuning, certificate automation, and policy enforcement
  • Develop automation scripts and workflows using Python or Java to support provisioning, configuration, governance, certificate management, and operational efficiency
  • Develop robust APIs using Java Spring Boot or Python to expose provisioning, configuration, deployment governance, certificate management, and capacity automation workflows
  • Design and implement GitOps‑driven workflows to automate server configuration updates—such as routing rules, reverse proxy updates, JVM or container runtime changes, TLS rotation, module/plugin configuration, and environment policies
  • Build and maintain self‑service platform capabilities enabling developers to request server instances, deploy applications, configure routing, request certificates, manage JNDI/resources, and consume metrics through APIs or service catalogs
  • Collaborate across engineering, security, and product teams to ensure platform automation aligns with organizational goals and best practices
  • Participate in architecture and code reviews while mentoring engineers on server operations, IaC design patterns, and automation best practices
  • Continuously improve platform reliability, performance, scalability, and operational efficiency through automation modernization and engineering excellence.

Benefits

  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Scholarships for dependent children
  • Adoption reimbursement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service