Senior Site Reliability Engineer
Striveworks
·
Posted:
August 17, 2023
·
Onsite
About the position
As a Senior Site Reliability Engineer at Striveworks, you will be responsible for maintaining, optimizing, and enhancing on-premises computing environments. Your role will involve leading and executing technical aspects of implementation projects, ensuring seamless integration and customization of software solutions for clients. Additionally, you will be responsible for maintaining software deployments both on-prem and in various cloud service providers, using Infrastructure-as-Code methodologies. This position offers a fully remote work environment with the option to travel to customer sites if needed.
Responsibilities
- Oversight for the automation of infrastructure-as-code for standing up virtual machines and custom Kubernetes clusters in AWS, Azure, GCP, on-premises, or hybrid cloud environments
- Internal triage of issues reported by platform users
- Working with platform developers to define requirements and build solutions for customer use cases of the platform
- Software deployments to on-prem unclassified, CUI, Secret, and Top Secret networks
- Participate in on-call rotations and incident response to swiftly address and resolve critical system issues
- Leading a small team of Site Reliability Engineers who are directly engaged with the customer, predominantly with their networking and platform management teams, to sustain the Chariot platform in air-gapped computing environments
- Contributing to the success of mission-critical systems within National Security and Commercial clients
- Wearing multiple hats and stepping into vacuums where more work is needed
- Exploring new technologies
- Working side-by-side daily with software engineers, data scientists, and end users of our products
- Holding a Top Secret U.S. security clearance
- Having 4+ years total experience as a Site Reliability Engineer, Software Engineer, or DevOps Engineer
- Having 2+ years relevant experience in developing for and/or deploying microservices in Kubernetes, programming in Python and Golang, writing and deploying Helm Charts, deploying a web-based application to a DoD/IC air-gapped network, automation and infrastructure-as-code (e.g. Terraform, Ansible), and deploying infrastructure in a cloud such as AWS, Azure, GCP, or OpenStack
- Understanding networking concepts, security best practices, and disaster recovery strategies
- Excellent communication and collaboration skills to work effectively in a cross-functional team environment
- Strong problem-solving skills and the ability to troubleshoot complex technical issues
- Deploying, maintaining, or contributing to CNCF projects (desired)
- Deploying, managing, and/or supporting enterprise information systems in a DoD environment (desired)
- Familiarity with U.S. federal information system security policies, including Security Technical Implementation Guides (STIGs), NIST 800-171, NIST 800-53, CMMC, ICD 503 (desired)
- Knowledge of DevSecOps, CI/CD pipelines, or automation (desired)
Requirements
- Top Secret U.S. security clearance
- 4+ years total experience as a Site Reliability Engineer, Software Engineer, or DevOps Engineer
- 2+ years relevant experience in:
- Developing for and/or deploying microservices in Kubernetes
- Programming in Python and Golang
- Writing and deploying Helm Charts
- Deploying a web-based application to a DoD/IC air-gapped network
- Automation and infrastructure-as-code (e.g. Terraform, Ansible)
- Deploying infrastructure in a cloud such as AWS, Azure, GCP, or OpenStack
- Understanding of networking concepts, security best practices, and disaster recovery strategies
- Excellent communication and collaboration skills to work effectively in a cross-functional team environment
- Strong problem-solving skills and the ability to troubleshoot complex technical issues
- Deploying, maintaining, or contributing to CNCF projects (preferred)
- Deploying, managing, and/or supporting enterprise information systems in a DoD environment (preferred)
- Familiarity with U.S. federal information system security policies, including Security Technical Implementation Guides (STIGs), NIST 800-171, NIST 800-53, CMMC, ICD 503 (preferred)
- Experience with DevSecOps, CI/CD pipelines, or automation (preferred)
Benefits
- Top-of-market salary and total compensation
- Generous equity plan
- Health/vision/dental insurance
- Flexible PTO
- Parental leave