About The Position

We are seeking an experienced Senior Systems system Engineer to lead the development of automation software for our storage infrastructure. You will work across multiple teams and organizations to build scalable, reliable systems that power our distributed storage platform. Lead the Hardware Engineering (HWEng) System Development (SysDE) effort to define and build software and enabling tools; track and report progress. Work across internal HWEng teams, to ensure new storage hardware addresses data path and control path functionality needed by the EC2, EBS, and S3 teams. Work closely with internal customers to identify early any potential problems with on-boarding new storage servers into their ecosystem. Build, manage, and deploy pipelines for rapid deployment of new code changes to a variety of org-owned and customer-owned systems. Build monitoring tools and metrics to ensure hardware is running properly in both lab and production environments. System Development Engineers in AWS Hardware Engineering wear many hats. From orchestration tooling development, to hardware integration, to kernel driver debugging, we dive deep into problems across the breadth of AWS. The ideal candidate will have a proven track record of rapidly coming up to speed on new engineering disciplines, making impactful decisions within that space, and have experience gluing together components written by more specialized engineers to create a cohesive, well-running engineering product.

Requirements

  • 5+ years of programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby experience
  • 5+ years of non-internship professional software development experience
  • 5+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
  • 5+ years of systems development in an IT or data center environment experience
  • 5+ years of deploying and operating in a Linux/Unix environment experience
  • 5+ years of systems design, software development, operations, automation, and process improvement experience
  • Experience leading the design, build and deployment of complex and performant (reliable and scalable) software solutions in production

Nice To Haves

  • Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
  • Experience taking a leading role in building complex software or computing infrastructure that has been successfully delivered to customers
  • 6+ years of development/programming/scripting language (Python/Java/Bash/Perl) experience
  • 10+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
  • Knowledge of AWS services, such as Elastic Compute Cloud (EC2), Elastic Block Storage (EBS), and Simple Storage Service (S3)
  • Experience in compute and storage server architecture and design for large scale applications
  • Experience in Unix, or experience scripting in modern programming languages

Responsibilities

  • Lead the development of automation software across teams
  • Design and build scalable system-level software with focus on durability, availability, security, and diagnostics
  • Develop and maintain device drivers for Linux on ARM and x86 architectures
  • Debug and resolve Linux boot and runtime issues across different processor architectures
  • Build automation solutions using modern programming languages (Python, Ruby, Java, etc.)
  • Work with OS internals and storage subsystems in Linux-based environments
  • Collaborate with teams working on distributed storage systems and storage hardware
  • Troubleshoot complex system-level issues in production environments
  • Help design servers, make them more robust

Benefits

  • equity
  • sign-on payments
  • medical
  • financial
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service