Site Reliability Engineer

iCIMSHolmdel, NJ
127d$100,000 - $140,000Hybrid

About The Position

We are seeking a skilled Engineer, Site Reliability (SRE) to contribute to the reliability, scalability, and performance of our multi-cloud SaaS platform serving thousands of customers worldwide. This role involves hands-on technical work in incident response, system monitoring, automation, and continuous improvement of our platform reliability. The successful candidate will work within a global SRE team to ensure optimal system performance and customer satisfaction.

Requirements

  • 4+ years experience in SRE, DevOps, or Infrastructure Engineering
  • Hands-on experience with AWS (required) and Azure (preferred)
  • Strong Linux system administration skills
  • Experience with monitoring tools (New Relic, Grafana, Prometheus)
  • Scripting skills in Python, Bash, or similar
  • Knowledge of databases (SQL Server, PostgreSQL, MongoDB)

Nice To Haves

  • SaaS experience in a global environment
  • Authentication and identity management systems knowledge
  • Cloud certifications (AWS, Azure, or Google Cloud)
  • Infrastructure-as-code tools (Terraform, CloudFormation)

Responsibilities

  • Monitor multi-cloud infrastructure (AWS, Azure, GCP) using New Relic, Grafana, and Sumo Logic
  • Maintain reliability of AWS resources, Auth0/Okta authentication, databases, and legacy applications
  • Implement monitoring, alerting, and dashboards for assigned systems
  • Respond to alerts and incidents within SLA timeframes
  • Perform root cause analysis and document findings
  • Create and maintain runbooks and troubleshooting procedures
  • Participate in 24/7 on-call rotation
  • Develop scripts to reduce manual operational overhead
  • Build monitoring and alerting solutions
  • Support infrastructure-as-code initiatives
  • Implement automated remediation where possible
  • Contribute to team knowledge base and mentor junior engineers

Benefits

  • Medical, dental, vision insurance
  • 401(k) plan
  • Dependent care benefits
  • Short term and long-term disability insurance
  • Life and AD&D insurance
  • Bonding and parental leave
  • Mindfulness resources
  • Open vacation policy
  • Sick days
  • Paid holidays
  • Quiet hours each workday
  • Tuition reimbursement

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Professional, Scientific, and Technical Services

Education Level

Bachelor's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service