Site Reliability Engineer

EnsonoDowners Grove, IL
12h$85,000 - $135,000

About The Position

At Ensono, our Purpose is to be a relentless ally, disrupting the status quo and unleashing our clients to Do Great Things ! We enable our clients to achieve key business outcomes that reshape how our world runs. As an expert technology adviser and managed service provider with cross-platform certifications, Ensono empowers our clients to keep up with continuous change and embrace innovation. We can Do Great Things because we have great Associates. The Ensono Core Values unify our diverse talents and are woven into how we do business. These five traits are the key to achieving our purpose. Honesty – Reliability – Curiosity – Collaboration - Passion About the role and what you'll be doing: We are seeking an experienced Site Reliability Engineer (SRE) with expertise in Infrastructure as Code tools like Terraform, core CI/CD tools such as Azure DevOps, and monitoring tools including DataDog and AWS CloudWatch. The ideal candidate will have commercial experience in technologies like Dotnet or Java, and be skilled in troubleshooting, incident resolution, and improving service and change management processes. Strong leadership in client-facing discussions and engagement with third-party suppliers is essential. An SRE Foundation certificate and a cloud provider associate-level certification are highly beneficial. Commercial experience and proficiency with industry standard: IAC tooling (Terraform preferably, or ARM/bicep and CloudFront) Core CI/CD Tooling (Azure DevOps, GitHub Actions or Gitlab) Monitoring Tooling (Splunk, NewRelic, Azure Monitor, AWS CloudWatch) Commercial experience in at least one core technology (Dotnet, Java, AI/Data Engineering, Golang) Troubleshooting issues and identifying systemic failings indicated by incidents/failures Implementing fixes Proposing solutions for reducing toil Providing leadership in the Incident resolution process, including creating and maintaining documentation, and providing key input to Post-mortem analysis Improving Service Requests and Change Management processes, both technically and through stakeholder management). Participate in the process for, and Proactively mitigate risks in a Security management process (Vulnerabilities in Code, Infrastructure, Dependencies) Lead discussion in client-facing meetings and discussions around the SRE process, and identifying areas for increasing SRE footprint. Engaging with suppliers and 3rd parties for support, requests and opportunities We want all new Associates to succeed in their roles at Ensono. That's why we've outlined the job requirements below. To be considered for this role, it's important that you meet all Required Qualifications. If you do not meet all of the Preferred Qualifications, we still encourage you to apply.

Requirements

  • 3-9 Years experience
  • Bachelor’s degree (or equivalent) in computer science or related discipline
  • SRE Foundation certificate (DevOps Institute) and a Cloud provider (AWS, Azure, GCP) 'associate'-level certification, or completed during the probationary period.
  • Proficiency in Azure and Kubernetes, with hands-on experience in managing and deploying applications.
  • Expertise in Infrastructure as Code (IaC) using Terraform for efficient and scalable infrastructure management.
  • Familiarity with Harness for continuous delivery and deployment processes.

Nice To Haves

  • Certified Kubernetes Administrator / Application Developer
  • Certified Azure DevOps Engineer
  • Experience with monitoring tools such as NewRelic or Splunk for effective system monitoring and alerting.
  • Strong programming skills in .Net, Java, or JavaScript for developing robust and scalable applications

Responsibilities

  • Troubleshooting issues and identifying systemic failings indicated by incidents/failures
  • Implementing fixes
  • Proposing solutions for reducing toil
  • Providing leadership in the Incident resolution process, including creating and maintaining documentation, and providing key input to Post-mortem analysis
  • Improving Service Requests and Change Management processes, both technically and through stakeholder management).
  • Participate in the process for, and Proactively mitigate risks in a Security management process (Vulnerabilities in Code, Infrastructure, Dependencies)
  • Lead discussion in client-facing meetings and discussions around the SRE process, and identifying areas for increasing SRE footprint.
  • Engaging with suppliers and 3rd parties for support, requests and opportunities

Benefits

  • Unlimited Paid Days Off
  • Three health plan options
  • 401k with company match
  • Eligibility for dental, vision, short and long-term disability, life and AD&D coverage, and flexible spending accounts
  • Family Forming Benefit including fertility coverage and adoption/surrogacy reimbursement
  • Paid childbearing and paternal leave
  • Education Reimbursement, Student Loan Assistance or 529 College Funding
  • Sabbatical leave
  • Wellness program
  • Flexible work schedule
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service