Amazon.com-posted 3 months ago
$116,300 - $201,200/Yr
Full-time • Mid Level
Culver City, CA
5,001-10,000 employees
General Merchandise Retailers

As a Site Reliability Engineer, you'll have end-to-end ownership of the product, user experience, design, and technology required to deliver state-of-the-art experiences for our Amazon MGM Studios customers. You'll get to work on projects that are fast-paced, challenging, and varied. You'll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people. We'll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. With global opportunities for talented technologists, you can decide where a career Prime Video Tech takes you! The Studios Technology team supports production personnel (cast and crew) and Studios business teams (production operations, post production, and marketing) and our Media Supply Chain. We work in a team environment and interact with production personnel and studio executives at all levels. Our Infrastructure Engineering team is looking for a Site Reliability Engineers to build, deploy, operate, and sustain our critical infrastructure and systems. The team will operationalize the stability and reliability of these systems and discover innovative ways to scale and operate them reliably as we expand. You will deploy and monitor the systems and automation to ensure that critical infrastructure is operating optimally and implement mechanisms to prevent service impacting incidents. You will utilize trends and metrics to identify opportunities for improvements within existing frameworks, tools and processes to continuously improve systems. Site Reliability Engineers focus on automating infrastructure at scale and using best practice stability and reliability protocols to ensure reliability and repeatability.

  • Deploy and monitor systems and automation to ensure critical infrastructure is operating optimally.
  • Implement mechanisms to prevent service impacting incidents.
  • Utilize trends and metrics to identify opportunities for improvements within existing frameworks, tools, and processes.
  • Automate infrastructure at scale using best practice stability and reliability protocols.
  • Collaborate with software development teams to optimize configurations and ensure systems functionality and performance.
  • Experience building services using AWS products.
  • Experience in automating, deploying, and supporting large-scale infrastructure.
  • Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, or Rust.
  • Experience with Linux/Unix.
  • Experience with CI/CD pipelines build processes.
  • Experience with distributed systems at scale.
  • Medical, financial, and/or other benefits.
  • Equity and sign-on payments as part of total compensation package.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service