About The Position

We are seeking an AWS Administrator to support the day-to-day operations, security, and performance of our AWS cloud infrastructure while managing enterprise-level workload scheduling and automation through Tivoli/IBM Workload Scheduler (TWS/IWS). This role is hands-on and operationally focused, requiring strong troubleshooting skills, disciplined execution, and the ability to support business-critical batch and cloud-native workflows. The ideal candidate has strong AWS administration experience, a clear grasp of workload automation platforms, and the ability to operate effectively in a production enterprise environment with minimal supervision. Core Purpose Maintain the health, security, and performance of AWS environments while ensuring reliable execution of automated job scheduling workflows using TWS.

Requirements

  • Ability to obtain and maintain Public Trust
  • All candidates supporting the CMS programs must have lived in the United States at least three (3) out of the last five (5) years prior in order to be considered.
  • Bachelor’s degree in computer science, Information Technology, or a related field, or equivalent professional experience.
  • 4+ years of experience in IT operations or infrastructure support roles.
  • At least 1 years of hands-on AWS administration experience with limited supervision.

Nice To Haves

  • Proficiency in Linux/Unix environments.
  • Strong scripting skills using Python and/or Bash.
  • Experience with automation and configuration tools such as Ansible or AWS CloudFormation.
  • AWS Certified SysOps Administrator – Associate (Encouraged)
  • Tivoli / IBM Workload Scheduler (TWS) Knowledge Core Architecture Understanding of TWS distributed architecture, including: Master Domain Manager (MDM): Central scheduling authority and database. Dynamic Workload Console (DWC): Web-based interface for job design and monitoring. Agents: Including Fault-Tolerant Agents (FTA) capable of running jobs during temporary MDM communication outages.
  • AWS Integration Deploy and manage TWS agents on EC2 instances to execute scripts and applications. Support cloud-native workflow triggers using AWS Lambda or Step Functions integrated with TWS. Manage file-based dependencies leveraging Amazon S3.
  • Daily Operations & Maintenance Create, modify, and maintain job definitions, calendars, and scheduling resources. Troubleshoot job failures and delays by analyzing logs and predecessor/successor relationships. Perform TWS patch upgrades and maintain integrations with enterprise monitoring tools such as IBM Tivoli Monitoring.
  • Clear operational discipline with attention to detail.
  • Ability to work effectively in a production support environment with on-call or after-hours responsibilities.
  • Clear communication skills and the ability to collaborate with infrastructure, application, and automation teams.

Responsibilities

  • AWS Infrastructure Management Provision, configure, and maintain AWS resources including EC2, S3, IAM, and VPCs in alignment with the AWS Well-Architected Framework. Perform routine system maintenance, patching, and configuration updates across cloud environments.
  • Workload Automation & Scheduling Operate and support the TWS/IWS platform, including job stream monitoring, dependency management, and agent health checks. Ensure reliable execution of daily and intraday production workloads.
  • Security & Compliance Enforce least-privilege access controls using AWS IAM. Monitor and remediate security findings using tools such as AWS Security Hub. Support audit and compliance requirements related to cloud infrastructure and automation platforms.
  • Incident Response & Operations Support Triage, diagnose, and resolve incidents involving AWS infrastructure and failed or delayed scheduling batches. Escalate complex issues appropriately and participate in root-cause analysis efforts.
  • Performance & Cost Optimization Identify performance bottlenecks and inefficiencies related to cloud resource usage and job throughput. Implement auto-scaling, scheduling adjustments, or script improvements to improve performance and control costs.
  • Backup & Disaster Recovery Maintain, test, and document backup and disaster recovery strategies for AWS resources and TWS databases. Participate in disaster recovery exercises and validate recovery procedures.

Benefits

  • Employment benefits include competitive compensation, Health and Wellness programs, Income Protection, Paid Leave and Retirement.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service