AppOps (Application Operations) Engineer

FiservLincoln, NE
1dOnsite

About The Position

About your role: You will operate and improve the cloud-native application platform that supports our Digital Consumer Experience products, working closely with engineering, SRE, and support teams to maintain availability and performance. You will focus on AWS services and automation to scale infrastructure reliably, respond to production incidents, and drive continuous improvement in deployment speed and operational consistency.

Requirements

  • 6+ years of experience supporting cloud-native production applications and infrastructure in AWS, including EKS, EC2, VPC, RDS, S3, KMS, and networking components such as Subnets, Route Tables, and Security Groups.
  • 6+ years of experience implementing and maintaining Infrastructure as Code using Terraform or AWS CloudFormation to provision, configure, and scale cloud environments.
  • 4+ years of experience operating container platforms and managing Kubernetes clusters, including upgrades, scaling, and troubleshooting.
  • 3+ years of experience using observability and monitoring tools such as Splunk, Dynatrace, and Amazon CloudWatch to detect and diagnose production issues.
  • 2+ years participating in a 24x7 on-call rotation supporting production services and responding to incidents.
  • AWS certification (Associate or Professional) or equivalent certification preferred, or equivalent combination of certification, training, and/or on-the-job experience.
  • Bachelor's degree or Master's degree in Computer Science, Information Technology, or a related field, or equivalent combination of education, related experience and/or military experience.

Nice To Haves

  • Experience with container tooling such as Docker or Podman and image build pipelines.
  • Experience with CI/CD platforms such as AWS CodePipeline, Azure DevOps, or GitLab CI.
  • Experience configuring and managing web servers and reverse proxies (Nginx, Apache).
  • Familiarity with ServiceNow, JIRA, Confluence, and incident management workflows.

Responsibilities

  • Provide hands-on support for production and non-production environments, including software installation and upgrades, patch deployment, configuration management, and system tuning.
  • Manage and upgrade Amazon Elastic Kubernetes Service (EKS) clusters, including cluster lifecycle, scaling, and troubleshooting.
  • Develop, maintain, and evolve Infrastructure as Code (IaC) and automation for provisioning, CI/CD, deployments, and configuration management.
  • Implement and maintain application and infrastructure monitoring, alerting, and incident response processes; act as a hands-on technical resource during production incidents.
  • Collaborate with engineering teams to integrate key performance indicators (KPIs) into the monitoring framework and recommend capacity and performance improvements.
  • Participate in release deployments, disaster recovery planning, capacity planning, and performance tuning to ensure operational continuity.
  • Support a 24×7 environment by resolving issues with minimal customer impact and participating in an on-call rotation.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service