This job is closed

We regret to inform you that the job you were interested in has been closed. Although this specific position is no longer available, we encourage you to continue exploring other opportunities on our job board.

Cisco Systemsposted 15 days ago
Full-time • Mid Level
Hybrid • Cary, NC
Professional, Scientific, and Technical Services
Resume Match Score

About the position

We are a software engineering team building platforms and tools that streamline infrastructure and platform service delivery, improve reliability, and enable the automation of IT operational functions at an extensive scale. Our customers include engineers from various engineering business units and enterprise application teams that rely on our IT infrastructure and platform services to run the business. We operate in a DevOps model, where our developers are responsible for the complete software development lifecycle, from design through operations. While we work closely with infrastructure, solving problems through software development is at our core. This role offers a superb opportunity to work with a distributed team to transform how infrastructure and cloud platforms are developed and managed using software development, AI, and automation.

Responsibilities

  • Analyze existing systems and identify areas for improvement in terms of reliability, performance, and automation.
  • Develop and implement automation solutions to reduce toil and improve operational efficiency.
  • Collaborate with software engineers to design and implement highly resilient and scalable architectures.
  • Define and monitor service level indicators (SLIs) and service level objectives (SLOs) using the team's observability and service assurance tooling.
  • Participate in operational support and responding to incidents in a timely and effective manner.
  • Participate in blameless postmortems and implementing preventative measures.
  • Implement and enforce security best practices, policies, and procedures to ensure a high degree of security hygiene.
  • Drive adoption and education of SRE standard methodologies within our team.

Requirements

  • Bachelor's degree in computer science, computer engineering, electrical engineering or equivalent is required with minimum 5 years of experience in an SRE, DevOps or related role.
  • Minimum 2 years of programming skills in Go.
  • Minimum 2 years of experience using configuration management tools, such as Terraform or Ansible.
  • Proficiency in containerization technologies, demonstrated by at least 2 years of experience working with Docker and Kubernetes.
  • Strong understanding of service level indicators (SLIs) and service level objectives (SLOs), with practical experience in defining and measuring these metrics in a production environment.

Nice-to-haves

  • MS degree preferred.
  • Experience with Python.
  • Familiarity with cloud platforms such as AWS, Azure, or GCP.
  • Knowledge with virtualization platforms such as VMware, Nutanix, OpenStack, Anthos, OpenShift.
  • Working knowledge of observability tools such as Prometheus, Grafana, Splunk, and Zabbix.
  • Practical experience with scrum agile development methodologies.
  • Experience supporting business-critical enterprise applications.
  • Experience with workflow orchestration tools (e.g., Stackstorm, Argo Workflows).

Benefits

  • Quality medical, dental and vision insurance.
  • 401(k) plan with a Cisco matching contribution.
  • Short and long-term disability coverage.
  • Basic life insurance.
  • Numerous wellbeing offerings.
  • Up to twelve paid holidays per calendar year.
  • Paid time off for volunteering (80 hours each year).
  • Flexible Vacation Time Off policy for exempt new hires.
  • Sick Time Off policy with 80 hours provided on hire date and annually thereafter.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service