Senior DevOps Engineer (Active Secret Clearance)

StriveworksAustin, TX
Hybrid

About The Position

Striveworks helps organizations harness the power of artificial intelligence to solve real-world national security and business challenges by serving as the command center between data, models, and business outcomes. Founded by data scientists and engineers, Striveworks set out to make the journey from deployment to ongoing optimization simple and effective. With Striveworks, organizations aren’t just deploying AI—they’re building systems that remain reliable, adaptable, and ready to scale in an unpredictable world. Mission-critical operations require models that perform where they’re deployed, scale as workloads grow, and adapt rapidly as AI capabilities advance. Striveworks meets these demands, increasing reliability and performance while lowering costs—and enabling confident, data-driven decision-making in dynamic environments. As a Senior DevOps Engineer at Striveworks, you will be challenged—and trusted—on day one to take ownership of specific product deployments by maintaining, optimizing, and enhancing our on-premises and cloud computing environments. You will play a crucial role in the successful deployment of our software solutions to clients. You will be responsible for executing technical aspects of implementation projects and for ensuring the seamless integration, customization, and configuration of our software. Your expertise will play a critical role for the company as we deploy new instances of Striveworks’ AI operations (AIOps) capabilities to customer infrastructure. You are right for this opportunity if you value and possess technical expertise and you enjoy pushing the boundaries of your capabilities. You will be responsible for maintaining Striveworks’ software deployments using Infrastructure-as-Code (IaC) methodologies. The Senior DevOps Engineer works on the DevOps team. You will be responsible for monitoring, automating, and improving software reliability, performance, and availability for various projects. You will also act as a liaison between platform developers and customer-facing teams, taking on operational tasks to ensure the efficient functioning of Striveworks’ solutions. You will work alongside a team of software engineers and data scientists to help them deploy and operate their work as functional products, learning from them so that building effective AI solutions becomes second nature. You may provide guidance and leadership to junior DevOps team members. You will directly contribute to the success of mission-critical systems within national security and commercial clients. You will be expected to wear multiple hats and to step into vacuums where improvements are needed, and you will be given the breadth to explore new technologies and solutions. This position offers a fully remote work environment, or you can work hybrid/on site at our office in northwest Austin, TX. You will be expected to travel up to 20% of the time.

Requirements

  • 6+ years of direct, hands-on experience in Python and/or Golang programming, or other general purpose programming languages
  • 6+ years of direct, hands-on experience in Microservice deployment in Kubernetes
  • 6+ years of direct, hands-on experience in Diagnosing and resolving issues within containerized environments
  • 6+ years of direct, hands-on experience in Helm Chart and Kustomizations development/deployment
  • 6+ years of direct, hands-on experience in Automation and IaC (e.g., Terraform, Ansible)
  • 6+ years of direct, hands-on experience in Cloud infrastructure (e.g., AWS, Azure, GCP, or OpenStack)
  • 6+ years of direct, hands-on experience in Managing and troubleshooting Linux systems (e.g., RHEL, Ubuntu, CentOS)
  • The ability to work cross functionally to define requirements and build solutions for customer use cases of the platform
  • The ability to respond professionally and competently to incident reports and triage critical system faults
  • Active Secret (or above) US security clearance
  • US citizenship

Nice To Haves

  • Experience with US federal information system security policies, including Security Technical Implementation Guides (STIGs), NIST 800-171, NIST 800-53, CMMC, and ICD 503
  • Experience with software deployments to on-premises and cloud-based unclassified, CUI, and classified networks within the DOD
  • Experience with DevSecOps/DevOps and CI/CD for the administration and deployment of GPU-enabled servers
  • Experience deploying or maintaining Cloud Native Computing Foundation (CNCF) projects
  • Experience with network-attached storage (NAS) and storage area network (SAN) technologies
  • Experience with Kubernetes and cloud-native applications and services in denied, disrupted, intermittent, and limited impact (DDIL) environments

Responsibilities

  • Automating IaC to manage virtual machines and deploy containers, services, and other infrastructure; leaning on expertise to deploy custom Kubernetes clusters in AWS, Azure, GCP, on-premises, or hybrid cloud environments
  • Working with platform developers, other DevOps teammates, and customer-facing teams to define requirements and build solutions for customer use cases of the platform
  • Software deployments to commercial and, later, unclassified, CUI, and classified Department of Defense (DOD) networks
  • Incident response and initial triage of critical system faults
  • Monitoring, automating, and improving software reliability, performance, and availability for various projects
  • Acting as a liaison between platform developers and customer-facing teams, taking on operational tasks to ensure the efficient functioning of Striveworks’ solutions
  • Working alongside a team of software engineers and data scientists to help them deploy and operate their work as functional products
  • Providing guidance and leadership to junior DevOps team members
  • Taking ownership of specific product deployments by maintaining, optimizing, and enhancing on-premises and cloud computing environments
  • Executing technical aspects of implementation projects and ensuring the seamless integration, customization, and configuration of software
  • Maintaining Striveworks’ software deployments using Infrastructure-as-Code (IaC) methodologies
  • Directly contributing to the success of mission-critical systems within national security and commercial clients
  • Stepping into vacuums where improvements are needed and exploring new technologies and solutions

Benefits

  • Medical/dental/vision insurance
  • Voluntary life, long-term disability, accident, and hospital indemnity insurance
  • HSA and FSA (including dependent care FSA) plans
  • 401(k) plan
  • Unlimited PTO
  • Paid parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service