About The Position

Are you experienced in managing Windows, Linux, and Cloud systems with a passion for building highly-available and secure infrastructure at massive scale? Join us in driving cloud innovation and automation, deploying mission-critical systems that power AWS' global operations. AWS Infrastructure Services designs, delivers, and operates the global infrastructure that keeps AWS running. We manage the critical data center components - from servers and networking to power and cooling - that ensure continuous service for our customers. Within this organization, the Controls Fleet team builds and supports critical infrastructure services that power the world's premier e-commerce, cloud computing, and AI/ML environments. We are seeking a passionate and motivated Sr. Systems Engineer to join our team of Windows, Linux, and AWS Cloud experts. In this role, you'll deploy highly secure infrastructure and build automated solutions across our growing environment. The ideal candidate will have strong experience in Cyber Security, Systems Engineering and DevOps for Windows and Linux systems. Hands-on experience with automation and scripting (PowerShell, CDK, Python, or Bash), and solid understanding of networking and distributed systems. Strong communication and documentation skills are essential, as you'll be collaborating closely with global teams. We value candidates who solve complex technical challenges through root cause analysis, can adapt to a fast-paced environment, and effectively communicate technical concepts. You'll make an impact by deploying and managing critical infrastructure, creating automated solutions for large-scale deployments, collaborating with teams to improve system reliability, and implementing solutions to technical challenges. You will be working in a hyper-growth environment where priorities shift quickly. You must be flexible and adapt well to a wide range of tasks and technologies. At Amazon, it is expected that your technical knowledge demonstrates both depth and breadth. Leveraging the strengths of individual team members as peers and delegating tasks appropriately within the group for long term projects will all be critical tasks for this role. Deep knowledge of the domain and is sought after as a thought-leader across the organization. If you are passionate about technology, excited by the prospect of working in a dynamic, fast-paced environment, and driven to solve complex problems, we would love to hear from you. Minimal Travel Required. You are expected to be onsite at a minimum five days a week.

Requirements

  • 4+ years of site reliability engineering (SRE), systems engineering, systems administration, DevOps, security administration, or network administration experience
  • 5+ years of Linux experience
  • 5+ years of systems engineering experience
  • Bachelor's degree in Systems Engineering, Computer Science, or related field or relevant work experience
  • Experience in site reliability engineering (SRE), systems engineering, systems administration, DevOps, security administration, or network administration
  • Experience working with Linux
  • Experience in systems engineering
  • Experience in any of the following: Python, Java, Perl, PHP, Ruby, Bash, Shell or equivalent
  • Knowledge of TCP/IP and networking protocols such as HTTP and DNS
  • Experience designing and developing scripts to automate operational burdens and reviewing scripting changes to ensure they meet the standards for maintainability, scalability and security
  • Experience working in 24/7 production environment
  • Experience with service-oriented architecture and web services

Responsibilities

  • Security & Compliance: Identify security risks, develop at-scale mitigation plans, and participate in compliance efforts
  • Technical Infrastructure: Deploy, manage, and support large-scale Windows and Linux environments, including virtualization platforms and networking components
  • Automation & Development: Create and maintain automation solutions using PowerShell, Python, CDK, CloudFormation, or Bash, focusing on highly secure and scalable deployment and infrastructure management processes
  • Cloud Services: Work with AWS or similar cloud platforms to support hybrid infrastructure environments
  • System Design: Architect and implement secure, scalable solutions while considering system interdependencies and limitations
  • Problem Solving: Analyze complex technical issues and develop effective solutions through root cause analysis
  • Documentation & Training: Create and maintain technical documentation, develop training materials, and support team knowledge sharing
  • Collaboration: Work effectively with global teams, provide technical consultation, and support cross-functional projects
  • Operational Support: Available for 24/7 on-call rotation and up to 20% travel

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service