About The Position

AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help. You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion. Do you like helping U.S. Intelligence Community agencies implement innovative cloud computing solutions and solve technical problems? Would you like to do this using the latest cloud computing technologies? Do you have a knack for helping these groups understand application architectures and integration approaches, and the consultative and leadership skills to launch a project on a trajectory to success? Amazon is seeking a Linux/Unix Systems Administrator/Engineer with the ability to automate day to day tasks and develop/build software and/or services from the ground up. A good candidate must have strong Linux/Unix Systems Administration knowledge, including shell scripting, and a proficiency in at least one development language. They must be able to think at “Amazon scale” to solve problems in permanent, sustainable, and scalable ways. With an eye toward utilizing the best solution over the quickest solution. At Amazon scale, Network Engineers rely on an ever increasing number of tools to manage thousands of network devices that support AWS services. Our Systems Team manages the hundreds of tools/services and components that the Network Engineers rely on to keep the network operational. This includes systems that track loss and incident correlation, scaling and building of new and existing network devices, and a full suite of monitoring tools. Many of these tools provided integrations with one another to give a broad to in depth view of the status of the network and aid in troubleshooting. This position requires that the candidate selected be a US Citizen and must currently possess and maintain an active TS/SCI security clearance with polygraph.

Requirements

  • Must be able to work in a 24x7 team on call rotation, with ability to drive into workplace for critical events/needs.
  • The ability to sit in front of computer during scheduled work hours with appropriate breaks while maintaining a high level of alertness and attention to detail.
  • Travel to data center/systems sites and Amazon/customer offices as needed.
  • Experience dealing with customers during problem resolution and operating efficiently under pressure.
  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience programming with at least one software programming language
  • Current, active US Government Security Clearance of TS/SCI with Polygraph
  • 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Bachelor's degree in computer science or equivalent

Responsibilities

  • The Systems Development Engineers on the team are responsible for maintaining the network tools/systems described above within US GovCloud and other US Government air gapped regions.
  • This includes troubleshooting problems with systems and services, regular deployment of new versions of the systems and their subcomponents, deployment/system validation and testing, service monitoring, standing up new services/tools, etc.
  • The team works with many different internal Software Development teams to drive improvement of the systems/services within the team's scope.
  • It is important to be able to work collaboratively and independently to investigate and document issues and create solutions to solve them at scale.
  • Calmly and quickly diagnose and fix critical systems failures in high pressure situations?
  • Manage and grow innovative, production-quality tools to solve real operational problems, in Python, Perl, Ruby, Shell, Java, etc.?
  • Investigate complicated technical issues scientifically and thoroughly, and assist in fixing them so they don't come back?
  • Understand how a modern, cloud-hosted application stack works from top to bottom?
  • Know how to provide technical solutions to real business problems in a global organization?
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service