About The Position

The Cloud & Infrastructure Operations Engineer is responsible for managing and supporting enterprise cloud and on‑prem infrastructure across AWS, OCI, and on‑premises environments. This role focuses on server operations, cloud services administration, backup and recovery platforms, security compliance, and infrastructure reliability. The position works closely with application teams, DBAs, cloud engineers, and infrastructure stakeholders to ensure availability, security, compliance, and recoverability of systems.

Requirements

  • 2+ years experience in an Infrastructure Operations Engineer position
  • Hands‑on experience managing AWS, OCI, and on‑prem infrastructure environments.
  • Strong knowledge of Windows server administration
  • Experience with VMware virtualization platforms
  • Proficiency with cloud monitoring and logging tools (AWS CloudWatch, CloudTrail, SolarWinds, OCI monitoring)
  • Experience with patch management, vulnerability management, and security hardening
  • Hands‑on experience with enterprise backup platforms
  • Knowledge of backup SLAs, compliance requirements, and audit support
  • Strong troubleshooting, documentation, and cross‑team collaboration skills
  • Bachelor’s degree in computer science or Equivalent

Nice To Haves

  • Experience supporting hybrid cloud and multi‑cloud environments.
  • Familiarity with IAM, MFA, and role‑based access controls across cloud and backup platforms
  • Experience with cost optimization and cloud fiscal management
  • Experience supporting Linux based systems.
  • Experience supporting database platforms and coordinating SQL Server upgrades.

Responsibilities

  • Manage AWS, OCI, and on-premises server environments, including virtualized and physical systems.
  • Administer server operating systems, security configurations, and VMware platforms.
  • Perform device and console administration across cloud platforms.
  • Manage object storage, volume storage, and file system storage.
  • Support cloud networking, endpoints, and site-to-site VPN connectivity.
  • Implement and maintain monitoring and reporting using AWS CloudWatch, CloudTrail, SolarWinds, and OCI monitoring tools.
  • Track system health, performance, and availability.
  • Perform quarterly cloud cost analysis and reporting.
  • Execute monthly patching for Microsoft based servers.
  • Manage post-patching services and validation activities.
  • Implement IAM controls and server hardening standards.
  • Manage endpoint protection solutions with tools such as CrowdStrike and Qualys
  • Conduct vulnerability assessment and vulnerability management activities
  • Ensure systems meet internal security and compliance requirements
  • Configure and maintain backup & replication servers, repositories, and jobs.
  • Perform on demand restores for files, virtual machines, databases, and applications.
  • Validate recoverability through regular test restore drills.
  • Monitor daily backup, replication, archival, and snapshot jobs.
  • Generate backup health and status reports for stakeholders.
  • Ensure automated backups meet defined SLA and compliance requirements.
  • Manage backup SLA policies, archival tiers, and cloud integrations.
  • Monitor backup storage utilization across repositories and cloud archival tiers.
  • Support audits and maintain backup compliance documentation.
  • Coordinate monthly and quarterly refresh cycles across environments.
  • Coordinate application upgrades running on servers.
  • Partner with application teams, DBAs, cloud teams, and infrastructure teams
  • Gather and understand backup and recovery requirements for new applications and infrastructure updates.
  • Participate in change management processes and produce required documentation
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service