InStride-posted 3 months ago
$165,000 - $185,000/Yr
Full-time • Senior
Los Angeles, CA
101-250 employees

At InStride, people are our purpose. We believe that investing in people is the most powerful way to drive success—for individuals and organizations alike. As a public benefit corporation, we partner with leading employers to unlock opportunities for their employees, providing access to top-tier education programs that align with their employees’ career goals and the company’s business goals. Our mission goes beyond skill-building; we're here to empower our partners’ employees to advance their careers, elevate their expertise, and achieve meaningful personal and professional growth. No matter the team you’re on, our dedication to the success of our partners and their employees is what drives us. If you're passionate about making a difference and driving educational and professional advancement, InStride is the place for you.

  • Design and operate multi-region, fault-tolerant systems that ensure InStride’s learning platform is always available for learners and partners.
  • Deliver Infrastructure as Code libraries, CI/CD pipelines, and self-service capabilities that reduce operational toil and accelerate developer productivity.
  • Implement defense-in-depth strategies, policy-as-code guardrails, and proactive monitoring to protect sensitive data and maintain trust.
  • Define and enforce SLIs/SLOs, establish error-budget policies, and build monitoring frameworks that inform release readiness and operational decisions.
  • Deploy and manage service mesh solutions that secure, monitor, and optimize service-to-service communication across Kubernetes workloads.
  • Partner with engineering and security stakeholders to shape InStride’s AWS strategy, ensuring scalability, resilience, and cost efficiency.
  • Share expertise, lead design reviews, and guide teams toward modern DevOps and SRE practices, raising the technical bar across the organization.
  • 10+ years of experience in SRE, DevOps, or Platform Engineering roles operating production AWS workloads.
  • Hands-on expertise with AWS EKS, Kubernetes networking, Helm, autoscaling frameworks (Karpenter/Cluster Autoscaler), serverless architectures, and API Gateways.
  • Proven delivery of service mesh solutions (Istio, Linkerd, or AWS App Mesh) for secure and observable service-to-service communication.
  • Proficiency with Infrastructure as Code (IaC) using AWS CDK (TypeScript preferred/Python), Terraform, or CloudFormation.
  • Strong programming and automation skills in Go, Python, or TypeScript, with additional proficiency in Bash.
  • Demonstrated experience implementing policy-as-code with OPA/Rego or similar tooling integrated into CI/CD pipelines.
  • Solid understanding of SLI/SLO/error-budget methodologies and hands-on experience with monitoring and alerting stacks (Prometheus, Grafana, CloudWatch, Groundcover).
  • Deep knowledge of AWS security best practices, including IAM policies, encryption, OS hardening, and compliance enforcement.
  • Excellent communication skills with the ability to translate reliability metrics into business impact and guide incident/post-mortem discussions.
  • Experience mentoring engineers and influencing enterprise AWS and DevOps strategies without direct management responsibilities.
  • Familiarity with Internal Developer Portals (Backstage, Port, Cortex) and self-service automation is a strong plus.
  • 401(k) plan with company match
  • Flexible vacation policy
  • Paid family leave
  • Best-in-class health care benefits
  • Tuition coverage for 2,800+ online certificate and degree programs through the Step Forward program.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service