About The Position

Are you experienced in managing Windows and Linux systems with a passion for building highly-available infrastructure at massive scale? Join us in driving cloud innovation and automation, deploying mission-critical systems that power global operations. AWS Infrastructure Services designs, delivers, and operates the global infrastructure that keeps AWS running. We manage the critical data center components - from servers and networking to power and cooling - that ensure continuous service for our customers. Within this organization, the Controls Fleet team builds and supports critical infrastructure services that power the world's premier e-commerce and cloud computing environments. We are seeking a passionate and motivated Systems Engineer to join our team of Windows and Linux experts. In this role, you'll deploy infrastructure and build automated solutions across our growing environment. The ideal candidate will have strong experience in Windows Server administration and Linux systems, hands-on experience with automation and scripting (PowerShell, Python, or Bash), and solid understanding of networking and distributed systems. Strong communication and documentation skills are essential, as you'll be collaborating closely with global teams. We value candidates who solve complex technical challenges through root cause analysis, can adapt to a fast-paced environment, and effectively communicate technical concepts. You'll make an impact by deploying and managing critical infrastructure, creating automated solutions for large-scale deployments, collaborating with teams to improve system reliability, and implementing solutions to technical challenges. You will be working in a hyper-growth environment where priorities shift quickly. You must be flexible and adapt well to a wide range of tasks and technologies. At Amazon, it is expected that your technical knowledge demonstrates both depth and breadth. Leveraging the strengths of individual team members as peers and delegating tasks appropriately within the group for long term projects will all be critical tasks for this role. Deep knowledge of the domain and is sought after as a thought-leader across the organization. If you are passionate about technology, excited by the prospect of working in a dynamic, fast-paced environment, and driven to solve complex problems, we would love to hear from you. Minimal Travel Required. You are expected to be onsite at a minimum five days a week.

Requirements

  • Bachelor's degree in Systems Engineering, Computer Science, or related field or relevant work experience
  • 4+ years of site reliability engineering (SRE), systems engineering, systems administration, DevOps, security administration, or network administration experience
  • 2+ years of building scripts, tooling, and automation for large-scale computing environments experience
  • Experience in any of the following: Python, Java, Perl, PHP, Ruby, Bash, Shell or equivalent
  • Experience designing and developing scripts to automate operational burdens and reviewing scripting changes to ensure they meet the standards for maintainability, scalability and security
  • Experience working in 24/7 production environment
  • Experience with service-oriented architecture and web services

Responsibilities

  • Technical Infrastructure: Deploy, manage, and support large-scale Windows and Linux environments, including virtualization platforms and networking components
  • Automation & Development: Create and maintain automation solutions using PowerShell, Python, or Bash, focusing on scalable deployment processes and infrastructure management
  • Cloud Services: Work with AWS or similar cloud platforms to support hybrid infrastructure environments
  • System Design: Architect and implement secure, scalable solutions while considering system interdependencies and limitations
  • Problem Solving: Analyze complex technical issues and develop effective solutions through root cause analysis
  • Documentation & Training: Create and maintain technical documentation, develop training materials, and support team knowledge sharing
  • Collaboration: Work effectively with global teams, provide technical consultation, and support cross-functional projects
  • Security & Compliance: Identify security risks, develop mitigation plans, and participate in compliance efforts
  • Operational Support: Available for 24/7 on-call rotation and up to 20% travel

Benefits

  • Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave.
  • Learn more about our benefits at https://amazon.jobs/en/benefits/us-benefits-and-stock .
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service