Senior DevOps Engineer (Active Secret Clearance)

StriveworksAustin, TX
Hybrid

About The Position

Striveworks helps organizations leverage artificial intelligence to solve national security and business challenges by acting as a command center for data, models, and business outcomes. Founded by data scientists and engineers, Striveworks aims to simplify the journey from AI deployment to ongoing optimization, enabling organizations to build reliable, adaptable, and scalable AI systems. The Senior DevOps Engineer will take ownership of specific product deployments, maintaining, optimizing, and enhancing on-premises and cloud computing environments. This role is crucial for the successful deployment of software solutions to clients, involving the technical aspects of implementation projects, ensuring seamless integration, customization, and configuration of software. The engineer's expertise will be vital for deploying new instances of Striveworks’ AI operations (AIOps) capabilities to customer infrastructure, particularly within national security and commercial clients. The position involves working on the DevOps team, monitoring, automating, and improving software reliability, performance, and availability across various projects, and acting as a liaison between platform developers and customer-facing teams. The role requires a proactive individual who can step into areas needing improvement and explore new technologies.

Requirements

  • 6+ years of direct, hands-on experience in Python and/or Golang programming, or other general purpose programming languages
  • 6+ years of direct, hands-on experience in Microservice deployment in Kubernetes
  • 6+ years of direct, hands-on experience in Diagnosing and resolving issues within containerized environments
  • 6+ years of direct, hands-on experience in Helm Chart and Kustomizations development/deployment
  • 6+ years of direct, hands-on experience in Automation and IaC (e.g., Terraform, Ansible)
  • 6+ years of direct, hands-on experience in Cloud infrastructure (e.g., AWS, Azure, GCP, or OpenStack)
  • 6+ years of direct, hands-on experience in Managing and troubleshooting Linux systems (e.g., RHEL, Ubuntu, CentOS)
  • The ability to work cross functionally to define requirements and build solutions for customer use cases of the platform
  • The ability to respond professionally and competently to incident reports and triage critical system faults
  • Active Secret (or above) US security clearance
  • US citizenship

Nice To Haves

  • Experience with US federal information system security policies, including Security Technical Implementation Guides (STIGs), NIST 800-171, NIST 800-53, CMMC, and ICD 503
  • Experience with software deployments to on-premises and cloud-based unclassified, CUI, and classified networks within the DOD
  • Experience with DevSecOps/DevOps and CI/CD for the administration and deployment of GPU-enabled servers
  • Experience deploying or maintaining Cloud Native Computing Foundation (CNCF) projects
  • Experience with network-attached storage (NAS) and storage area network (SAN) technologies
  • Experience with Kubernetes and cloud-native applications and services in denied, disrupted, intermittent, and limited impact (DDIL) environments

Responsibilities

  • Automating Infrastructure-as-Code (IaC) to manage virtual machines and deploy containers, services, and other infrastructure
  • Deploying custom Kubernetes clusters in AWS, Azure, GCP, on-premises, or hybrid cloud environments
  • Working with platform developers, other DevOps teammates, and customer-facing teams to define requirements and build solutions for customer use cases of the platform
  • Executing software deployments to commercial and, later, unclassified, CUI, and classified Department of Defense (DOD) networks
  • Performing incident response and initial triage of critical system faults
  • Monitoring, automating, and improving software reliability, performance, and availability for various projects
  • Acting as a liaison between platform developers and customer-facing teams, taking on operational tasks to ensure the efficient functioning of Striveworks’ solutions
  • Working alongside a team of software engineers and data scientists to help them deploy and operate their work as functional products
  • Providing guidance and leadership to junior DevOps team members (may be required)
  • Directly contributing to the success of mission-critical systems within national security and commercial clients
  • Wearing multiple hats and stepping into vacuums where improvements are needed
  • Exploring new technologies and solutions

Benefits

  • Medical/dental/vision insurance
  • Voluntary life, long-term disability, accident, and hospital indemnity insurance
  • HSA and FSA (including dependent care FSA) plans
  • 401(k) plan
  • Unlimited PTO
  • Paid parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service