CoStar Suite - Senior Site Reliability Engineer

CoStar GroupWashington, DC
404d

About The Position

The Senior Site Reliability Engineer at Suite is responsible for enhancing the availability, reliability, performance, security, capacity, and efficiency of applications through best software engineering practices. This role involves setting coding standards, performing code reviews, coaching team members, and writing code to solve complex problems. The engineer will manage monitoring and logging systems, oversee data recovery processes, and ensure high availability mechanisms, collaborating closely with developers, DevOps, and security teams throughout the software development life cycle.

Requirements

  • Bachelor's Degree from an accredited university or college.
  • 7 or more years of experience with modern technologies such as React, Vue, TypeScript, NodeJS, C#, .Net, OpenSearch, Kafka, SQL, NoSQL.
  • Ability to debug, profile, optimize code, and automate routine tasks.
  • Experience in Agile/Scrum processes and Continuous delivery practices.
  • Experienced in unit, performance, and automation testing.
  • Strong understanding of business drivers for software development.
  • Self-motivated with systematic problem-solving skills and excellent communication.

Nice To Haves

  • Expertise in designing, analyzing, troubleshooting, and capacity planning for large-scale distributed systems.
  • Experience with frontend frameworks like React, Angular, or Vue.
  • Familiarity with tools such as SQL, Bash, PowerShell, TFS/Azure DevOps, and Git.
  • Experience with operating systems (Windows/Linux) and networking concepts.
  • Knowledge of cloud platforms like AWS or GCP, Python, Serverless, Docker, and Kubernetes.
  • Experience with log management and analytics tools like DataDog, Elasticsearch, and Grafana.

Responsibilities

  • Develop and provide operational support for full-stack software applications.
  • Collaborate with development operations staff to scale, monitor, and troubleshoot system infrastructure (on-premise & in the cloud).
  • Increase system resilience and serve large customer volumes with expert-level coding and change management skills.
  • Improve and write automation to enhance system diagnostic and debugging capabilities.
  • Collect operating system data and report performance metrics to stakeholders.
  • Troubleshoot and debug production issues as they arise.
  • Adhere to industry standard security best practices.

Benefits

  • Comprehensive healthcare coverage: Medical, Vision, Dental, Prescription Drug.
  • Life, legal, and supplementary insurance.
  • Virtual and in-person mental health counseling services.
  • Commuter and parking benefits.
  • 401(K) retirement plan with matching contributions.
  • Employee stock purchase plan.
  • Paid time off.
  • Tuition reimbursement.
  • On-site fitness center and/or reimbursed fitness center membership costs.
  • Access to Diversity, Equity, & Inclusion Employee Resource Groups.
  • Complimentary gourmet coffee, tea, and healthy snacks.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service