Site Reliability Engineer

CiscoDenver, CO
13d

About The Position

Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Come help organizations be their best, while you reach new heights with a team that has your back. Meet the Team The Splunk TechOps organization runs Splunk Cloud, blending SRE, Systems Engineering, and Service Engineering disciplines across functional global teams. We are a team striving for operational awesomeness and trying to automate the world. We empower our customers to execute our vision of making machine data accessible, usable, and valuable to everyone. In this role, you will use your cloud experience to drive the growth of Splunk Cloud while working with major industry vendors. Your Impact As a TechOps SRE, you will help maintain, contribute to, and improve the next generation of our large-scale Cloud offering. You will be responsible for the infrastructure that powers Splunk’s cloud services, moving tasks from small one-off implementations to massive scale across thousands of machines. You will be a data-driven decision-maker, ensuring that we are alerted to issues before our customers notice and constantly seeking ways to automate manual processes. Candidates that are humble, hungry to learn, and skilled will thrive in this role.

Requirements

  • Operational experience at scale with hands-on experience with Linux operating systems, cloud architecture (AWS, GCP, Azure) and networking.
  • Experience with architecture, deployments, and networking in one or more major cloud vendors.
  • Development skills in at least one language, such as Python, Shell or Go.
  • Familiarity with basic programming concepts, including input sanitization and unit testing.
  • A data-driven approach to decision-making and a passion for monitoring and feedback loops.

Nice To Haves

  • Previous experience in roles such as Systems Administrator, Network Engineer, or DevOps Engineer.
  • Experience working in Observability/infrastructure teams as a Splunk admin including distributed systems at scale and observability tools.
  • Experience working on distributed systems and a passion for finding edge cases that appear at scale.
  • A strong mindset for automation and the ability to bring small tasks to large-scale implementation.
  • Experience working with open-source projects and a desire to contribute back to the community.
  • Ability to balance high-level engineering goals with a healthy work-life commitment.

Responsibilities

  • Own the maintenance and improvement of Splunk’s large-scale Cloud infrastructure.
  • Automate every manual process and tedious task encountered to optimize system performance.
  • Work across global teams to ensure operational excellence and system reliability.
  • Collaborate with peers from engineering, product management, and customer support in a stable, supportive environment.
  • Actively take responsibility for projects and contribute to a culture of honesty and open communication.
  • Benefit from mentorship opportunities and continuous professional development

Benefits

  • U.S. employees are offered benefits, subject to Cisco’s plan eligibility rules, which include medical, dental and vision insurance, a 401(k) plan with a Cisco matching contribution, paid parental leave, short and long-term disability coverage, and basic life insurance.
  • Employees may be eligible to receive grants of Cisco restricted stock units, which vest following continued employment with Cisco for defined periods of time.
  • U.S. employees are eligible for paid time away as described below, subject to Cisco’s policies: 10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees 1 paid day off for employee’s birthday, paid year-end holiday shutdown, and 4 paid days off for personal wellness determined by Cisco
  • Non-exempt employees receive 16 days of paid vacation time per full calendar year, accrued at rate of 4.92 hours per pay period for full-time employees
  • Exempt employees participate in Cisco’s flexible vacation time off program, which has no defined limit on how much vacation time eligible employees may use (subject to availability and some business limitations)
  • 80 hours of sick time off provided on hire date and each January 1st thereafter, and up to 80 hours of unused sick time carried forward from one calendar year to the next
  • Additional paid time away may be requested to deal with critical or emergency issues for family members
  • Optional 10 paid days per full calendar year to volunteer
  • For non-sales roles, employees are also eligible to earn annual bonuses subject to Cisco’s policies.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service