Cloud Operations Analyst (Contract To Hire, FTE)

TEKsystemsFarmington Hills, MI
3d$40 - $60Hybrid

About The Position

The Cloud Operations Analyst is responsible for leading the management, optimization, and automation of cloud and on-premises infrastructure to ensure seamless operations and business continuity. This role includes driving improvements in observability, server and batch operations, and data center management while proactively identifying and resolving performance and reliability issues. The Cloud Operations Analyst provides technical leadership, mentors team members, and consults with cross-functional teams to enhance operational excellence through best practices, process enhancements, and cutting-edge technologies.

Requirements

  • Enterprise Windows-based systems administration - 5 plus years
  • Cloud Experience, AWS preferred- 3 plus years

Responsibilities

  • Independently develop, implement, and maintain observability tools to monitor cloud and on-premises systems.
  • Actively support infrastructure teams in the management and maintenance of server systems running on Windows and Linux.
  • Create dashboards, alerts, and reports to track system health, performance, and availability.
  • Analyze metrics and logs to identify trends, prevent potential issues, and optimize system performance.
  • Act as the lead consultant with FinOps teams to monitor resource utilization and ensure cost-effective operations across cloud environments.
  • Manage the lifecycle of cloud and on-premises servers, including provisioning, patching, configuration, and decommissioning.
  • Troubleshoot and resolve server-related issues, ensuring minimal downtime. Implement and enforce server security policies and compliance requirements.
  • Schedule, monitor, and manage batch processes to ensure timely execution of critical tasks.
  • Identify and resolve batch failures or delays, coordinating with relevant teams to ensure smooth operations.
  • Building new batch jobs for improved performance and resource utilization.
  • Lead on-site and remote data center operations, ensuring proper functioning of hardware, power, cooling, and network infrastructure.
  • Coordinate with vendors and service providers for hardware maintenance, replacements, and upgrades.
  • Participate in on-call rotations to address system incidents and outages promptly.
  • Conduct root cause analysis and implement solutions to prevent recurrence of issues.
  • Document and communicate incident resolution processes to relevant stakeholders.
  • Work closely with cross-functional teams, including DevOps, Networking, and Application Development, to implement and maintain system integrations.
  • Maintain comprehensive documentation for configurations, processes, and incident resolutions.
  • Provide training and support to team members and other departments.

Benefits

  • Medical, dental & vision
  • Critical Illness, Accident, and Hospital
  • 401(k) Retirement Plan – Pre-tax and Roth post-tax contributions available
  • Life Insurance (Voluntary Life & AD&D for the employee and dependents)
  • Short and long-term disability
  • Health Spending Account (HSA)
  • Transportation benefits
  • Employee Assistance Program
  • Time Off/Leave (PTO, Vacation or Sick Leave)
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service