About The Position

the Digital Modernization Sector has an opening for a Mid Tier Tivoli Workload Scheduler/AWS Administrator to support a large healthcare contract. We are seeking a Mid-Tier AWS Administrator to support the day-to-day operations, security, and performance of our AWS cloud infrastructure while managing enterprise-level workload scheduling and automation through Tivoli/IBM Workload Scheduler (TWS/IWS). This role is hands-on and operationally focused, requiring strong troubleshooting skills, disciplined execution, and the ability to support business-critical batch and cloud-native workflows. The ideal candidate has solid AWS administration experience, a working knowledge of workload automation platforms, and the ability to operate effectively in a production enterprise environment as well as advanced workload automation design, configuration and operational support. This role provides leadership for scheduling solutions, and servers as the subject matter expert for workload automation technologies Core Purpose Maintain the health, security, and performance of AWS environments while ensuring reliable execution of automated job scheduling workflows using TWS.

Requirements

  • Bachelor’s degree in computer science, Information Technology, or a related field, or equivalent professional experience. Additional years of experience may be substituted in lieu of degree.
  • 8 years of experience in IT operations or infrastructure support roles.
  • At least 2 years of hands-on AWS administration experience.
  • Must be able to obtain and maintain a public trust clearance
  • All candidates supporting the CMS programs must have lived in the United States at least three (3) out of the last five (5) years prior in order to be considered.
  • AWS Certified SysOps Administrator – Associate
  • Tivoli / IBM Workload Scheduler (TWS) Knowledge Core Architecture Understanding of TWS distributed architecture, including: Master Domain Manager (MDM): Central scheduling authority and database. Dynamic Workload Console (DWC): Web-based interface for job design and monitoring. Agents: Including Fault-Tolerant Agents (FTA) capable of running jobs during temporary MDM communication outages. Experience supporting high availability scheduling environments
  • AWS Integration Deploy and manage TWS agents on EC2 instances to execute scripts and applications. Support cloud-native workflow triggers using AWS Lambda or Step Functions integrated with TWS. Manage file-based dependencies leveraging Amazon S3. Familiarity with CI/CD automation pipelines.
  • Daily Operations & Maintenance Create, modify, and maintain job definitions, calendars, and scheduling resources. Troubleshoot job failures and delays by analyzing logs and predecessor/successor relationships. Perform TWS patch upgrades and maintain integrations with enterprise monitoring tools such as IBM Tivoli Monitoring.

Nice To Haves

  • Technical Skills Proficiency in Linux/Unix environments.
  • Strong scripting skills using Python and/or Bash.
  • Experience with automation and configuration tools such as Ansible or AWS CloudFormation.
  • Experience designing enterprise batch scheduling solutions Ability to analyze workload performance and optimize scheduling
  • Strong operational discipline with attention to detail.
  • Ability to work effectively in a production support environment with on-call or after-hours responsibilities.
  • Clear communication skills and the ability to collaborate with infrastructure, application, and automation teams.
  • Work well under pressure

Responsibilities

  • AWS Infrastructure Management Provision, configure, and maintain AWS resources including EC2, S3, IAM, and VPCs in alignment with the AWS Well-Architected Framework. Perform routine system maintenance, patching, and configuration updates across cloud environments.
  • Workload Automation & Scheduling Operate and support the TWS/IWS platform, including job stream monitoring, dependency management, and agent health checks. Ensure reliable execution of daily and intraday production workloads.
  • Security & Compliance Enforce least-privilege access controls using AWS IAM. Monitor and remediate security findings using tools such as AWS Security Hub. Support audit and compliance requirements related to cloud infrastructure and automation platforms.
  • Incident Response & Operations Support Triage, diagnose, and resolve incidents involving AWS infrastructure and failed or delayed scheduling batches. Escalate complex issues appropriately and participate in root-cause analysis efforts.
  • Performance & Cost Optimization Identify performance bottlenecks and inefficiencies related to cloud resource usage and job throughput. Implement auto-scaling, scheduling adjustments, or script improvements to improve performance and control costs.
  • Backup & Disaster Recovery Maintain, test, and document backup and disaster recovery strategies for AWS resources and TWS databases. Participate in disaster recovery exercises and validate recovery procedures.

Benefits

  • Employment benefits include competitive compensation, Health and Wellness programs, Income Protection, Paid Leave and Retirement.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service