DevOps Engineer [Talents Bench]

Metal Toad
Remote

About The Position

The Cloud Engineer position at Metal Toad requires experience in designing and maintaining infrastructure for high-availability, scalable, enterprise-grade applications. You will be part of a talented team that works on mission-critical applications.

Requirements

  • Read and agree to our Corporate Values Statement.
  • Believe in the company's mission: to help people.
  • Advanced to fluent English communication skills are essential for this role.
  • 4+ years of experience with Amazon Web Services (AWS). Additional experience with other cloud providers is a bonus.
  • Experienced in Linux and/or Windows Systems administration (at least one required).
  • Scripting (bash shell and Python preferred, PowerShell acceptable).
  • Knowledge of TCP/IP networking and HTTP protocols.
  • Experience with web accelerators, load balancers, reverse proxies, and CDNs.
  • Problem solver and willing to work in an agile/fast-paced environment.
  • Customer-oriented with good communication skills.
  • Willing to participate in a 24/7 on-call rotation with approximately one shift per month compensated.

Nice To Haves

  • AWS Certifications or be willing to get certified.
  • Interest in Generative AI technologies.

Responsibilities

  • Planning
  • Analyzing customer requirements for software components, system availability, security, and performance.
  • Designing and documenting complete cloud hosting systems, including capacity planning software and instance type selection, allocation, and network design.
  • Estimating the costs of the recommended system design.
  • Building systems by executing installation, configuration, and testing of cloud resources.
  • Using automation and configuration management to ensure repeatability and traceability of changes.
  • Managed Services
  • Troubleshooting system hardware, software, networks, and operating systems.
  • Protecting the integrity and security of systems through proper use of controls and monitoring tools, and providing written evaluations and recommendations for ongoing improvement.
  • Maintaining system performance through system monitoring and analysis, performance tuning, and planning for future growth.
  • Designing and running load and stress tests, documenting outcomes, debugging infrastructure issues, and escalating documented application problems to the development team.
  • Maintaining internal systems and customer deployment documentation.
  • Partnering with project managers, technical consultants, software architects, and developers to validate infrastructure deliverables against the requirements and document all technical hand-offs.
  • Experience with Amazon Web Services (AWS).
  • Responding to support tickets and incidents in a timely manner that corresponds to SLA commitments.
  • Expertise
  • Contributing to the definition of best practices, operational policies, and procedures.
  • Establishing, documenting, and testing disaster recovery procedures, documenting outcomes, and making recommendations for ongoing improvement.
  • Updating job knowledge by participating in educational opportunities, reading professional publications, maintaining personal networks, and participating in professional organizations.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service