Site Reliability Engineer, Principal

Parsons Corporation
2dRemote

About The Position

Parsons is looking for an amazingly talented Senior Site Reliability Engineer to join our team! In this role, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems. This position involves working closely with development and operations teams to build and maintain robust infrastructure, automate processes, and enhance system reliability. You will work closely with our partners as well as data engineers, analysts, and other stakeholders to ensure that our data-driven strategies are effective and aligned with our organizational goals. This role supports remote work, with occasional onsite attendance required for customer-facing meetings in Columbia, MD.

Requirements

  • Active Top Secret clearance
  • Bachelor’s degree from an accredited college or university or equivalent experience
  • A minimum of 6-10 years of experience in a DevOps or related role
  • Current Security+ certification
  • Proficient in Linux/Unix systems administration, cloud computing (AWS, Azure, GCP), automation (Python, Go, Bash, Ansible, Terraform), monitoring and alerting (Prometheus, Grafana, Datadog), and CI/CD pipelines
  • Experience with incident management, performance tuning, capacity planning, and security best practices
  • Strong problem-solving skills and attention to detail
  • Must possess excellent communication and a willingness to continuously learn and adapt to the ever-evolving technology landscape and collaboration skills, with the ability to work effectively in a team environment

Nice To Haves

  • Knowledge of database management and optimization
  • Previous experience in a DevOps or Agile environment

Responsibilities

  • Design, implement, and maintain scalable and reliable infrastructure solutions
  • Develop and manage monitoring and alerting systems to ensure system health and performance
  • Collaborate with development teams to improve system architecture and deployment processes
  • Automate repetitive tasks to improve efficiency and reduce human error
  • Troubleshoot and resolve complex system issues, ensuring minimal downtime
  • Document processes, systems, and configurations to ensure knowledge sharing and continuity

Benefits

  • medical
  • dental
  • vision
  • paid time off
  • 401(k)
  • life insurance
  • flexible work schedules
  • holidays
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service