Site Reliability Engineer II

RemitlyEvanston, IL
4d$71,600 - $119,400

About The Position

This position will resolve incidents and collate data in support of root cause analysis and systems design

Requirements

  • Programming & Scripting: Python, Bash scripting, Java, Angular
  • Cloud Platforms: AWS (EC2, S3, Lambda, Glue), Azure (Functions, Logic Apps, AKS), GCP (GKE, Cloud Functions)
  • Infrastructure as Code: Terraform, Ansible, Chef, Puppet
  • Containerization & Orchestration: Docker, Kubernetes
  • CI/CD & Automation: Jenkins, GitHub Actions, Bitbucket, GitLab
  • Monitoring & Observability: Prometheus, Grafana, DataDog, Dynatrace, Splunk, SignalFx
  • Networking & Security: AWS: VPCs, IAM, Transit Gateway, CloudWAN, route53, AWS KMS, RDS Azure: Application Gateway, VNET, Express route, private link, Azure firewall, MS Sentinel, Azure Entra ID, RBAC

Responsibilities

  • Monitoring & Observability: Create and optimize monitoring queries; establish service level baselines.
  • Incident Response: Support senior engineers during incidents; contribute to post-incident reviews.
  • Disaster Recovery: Participate in and help execute disaster recovery tests.
  • Automation & Infrastructure as Code: Implement automation and execute code in production environments.
  • Documentation: Contribute to SRE knowledge bases and documentation.
  • Collaboration: Work with cross-functional teams including Development, QA, IT Operations, and Product SRE.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service