Site Reliability Developer 3

OracleNashville, TN
6d

About The Position

Are you interested in building large-scale distributed infrastructure systems for the cloud? The OCI team is building new technologies that operate at high scale in a broadly distributed, multi-tenant cloud environment. Our customers depend on our work to run their businesses and our mission is to provide them with the best in class set of cloud-based services. As an ideal Site Reliability Development candidate you will have a strong understanding of Linux/Unix fundamentals and a rock solid knowledge of the full software build and deployment process (CI/CD). You will combine this knowledge with your hands on experience in software development - you will champion creating an engineering environment that embodies the best development and testing practices for delivering high quality services. You will be focusing on OCIs Region Build technology to design, research, and build creative solutions to extend Region Builds across the globe for OCI. These are exciting times in our space - we are growing fast; we are still at an early stage where an individual can have a significant impact. We are working on ambitious new initiatives. If you are passionate about taking ownership of big technical challenges and producing software solutions that have broad, significant impacts - come join our team!

Requirements

  • Bachelor’s degree in Computer Science or equivalent proven experience
  • 5+ years' experience building and operating large scale, highly available, cloud based distributed systems
  • Solid Python and terraform expertise is a must
  • Experience with Kubernetes and CI/CD pipelines is preferred
  • Validated understanding of operating system fundamentals
  • Thorough understanding of the latest security principles, techniques, and protocols
  • Strong troubleshooting and performance tuning skills
  • Knowledge of professional software engineering standard methodologies for the full software development process (CI/CD)
  • Experience building and operating scalable infrastructure software or distributed systems
  • Experience with cloud services and concepts Amazon/Azure.
  • Proven track record to achieve stretch goals in a highly innovative and fast-paced environment
  • Passion for technical leadership and mentoring
  • Strong verbal and written communication skills
  • Strong analytical skills, with excellent problem solving abilities
  • Specialist skills in a modern programming language such as Java, C, C++, C#, Go, or Python, with proficiency in additional languages preferred
  • Experience in Agile/SCRUM enterprise-scale software development
  • Direct experience with fleet orchestration for both virtual and containerized workloads

Responsibilities

  • In this role the engineer will be supporting our Region Build Automation efforts.
  • This engineer will work independently on projects with minimal oversight.
  • Work is typically focused on improving availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for entire services and ecosystems.
  • This engineer will have ownership of several small projects and services.
  • Modifies architecture of an existing subsystem or service to improve reliability, availability, and performance.
  • They will also collaborate on architectural design reviews and changes.
  • Modification and improvement to multiple component pipelines, deployments, validations.
  • Own and improve metrics, KPIs, SLOs and visualizations for a system.
  • Automating and mitigating sources of operational toil.

Benefits

  • Medical, dental, and vision insurance, including expert medical opinion
  • Short term disability and long term disability
  • Life insurance and AD&D
  • Supplemental life insurance (Employee/Spouse/Child)
  • Health care and dependent care Flexible Spending Accounts
  • Pre-tax commuter and parking benefits
  • 401(k) Savings and Investment Plan with company match
  • Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
  • 11 paid holidays
  • Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
  • Paid parental leave
  • Adoption assistance
  • Employee Stock Purchase Plan
  • Financial planning and group legal
  • Voluntary benefits including auto, homeowner and pet insurance
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service