Senior Infrastructure Platform Engineer (Remote)

Teaching Strategies, LLCDenton, TX
Remote

About The Position

Teaching Strategies is an innovative edtech organization focused on connecting teachers, children, and families. As front runners in the early childhood education market, we build dynamic, top-quality digital products that integrate all of the essential elements of a high-quality solution: curriculum, assessment, professional development, and family engagement. We are building a team of results-oriented individuals who will thrive in a collaborative, work-hard/play-hard culture. We pride ourselves on the impact we have on the early childhood field through supporting teachers who are doing the most important work there is, teaching children to become creative, confident thinkers. We’re seeking a senior Infrastructure Platform Engineer to help evolve our internal developer platform into an AI-first, intelligent system that makes infrastructure self-service, adaptive, and increasingly autonomous. In this role, you will work closely with platform leadership to implement and extend a vision for abstracting cloud infrastructure (AWS, Kubernetes, and Infrastructure as Code), while introducing AI-driven workflows, natural language interfaces, and intelligent automation into the developer experience. You’ll be a hands-on technical expert in AWS EKS and platform engineering, building extensible tooling and systems that enable developers to provision, operate, and troubleshoot infrastructure through intuitive, context-aware workflows. This is a high-impact role for a senior engineer who operates with autonomy, contributes to platform direction, and executes at a high level across teams.

Requirements

  • 8+ years in infrastructure, SRE, DevOps, or platform engineering role
  • Proven experience building or contributing to internal developer platforms
  • Deep expertise in Amazon Web Services, especially EKS, IAM, and networking
  • Strong programming skills (Python or Go), with experience building production-grade tools and services
  • Strong understanding of Infrastructure as Code, GitOps, and cloud-native architectures
  • Experience or strong interest in integrating AI/LLMs into developer or operational workflows
  • Familiarity with APIs from providers like OpenAI or similar
  • Understanding of event-driven architectures and their role in intelligent automation
  • Ability to design adaptive, feedback-driven systems rather than static workflows
  • Experience building developer tooling (CLIs, portals, SDKs)
  • Familiarity with platforms like Backstage, Port, or custom developer portals
  • Strong product mindset with a focus on usability and developer experience
  • Strong systems thinking and architectural design skills
  • Ability to operate independently while aligning with platform direction
  • Effective communication and cross-team collaboration skills

Responsibilities

  • Contribute to Platform Evolution: Partner with platform leadership to evolve the internal developer platform toward an AI-first model. Translate strategic direction into practical, scalable implementations. Help shape platform capabilities through hands-on engineering and technical input.
  • Build Self-Service Infrastructure: Design and implement self-service infrastructure systems that abstract away cloud complexity. Build APIs, CLIs, and developer-facing tools that enable infrastructure provisioning without requiring deep cloud expertise. Introduce AI-assisted interfaces (CLI, chat, UI) for provisioning, debugging, and operations.
  • Engineer and Scale Kubernetes Platforms (EKS): Design, build, and operate secure, multi-tenant Amazon EKS clusters. Contribute to platform architecture, scalability, cost optimization, and reliability. Implement policy, networking, and workload isolation best practices.
  • Implement Intelligent Platform Workflows: Build systems that interpret infrastructure signals (logs, metrics, events) and provide actionable insights. Develop AI-assisted workflows for incident triage, root cause analysis, and runbook generation/execution. Enable human-in-the-loop automation for safe and controlled remediation.
  • Develop Developer Tooling and Experience: Build platform SDKs, CLIs, and service templates that standardize infrastructure consumption. Contribute to AI-powered copilots and assistants that improve developer productivity. Enable natural language interaction with platform services where appropriate.
  • Codify Best Practices: Implement reusable infrastructure patterns and opinionated defaults. Encode security, scalability, and compliance into platform tooling. Extend Infrastructure as Code practices (e.g., OpenTofu) with abstraction and validation layers.
  • Drive Observability and Insight: Implement logging, metrics, and tracing across platform systems. Leverage AI to detect anomalies, summarize system behavior, and surface actionable insights. Contribute to systems that move from visibility to guided action.
  • Iterate and Improve: Contribute to platform instrumentation and feedback loops. Use data and developer feedback to improve usability, performance, and adoption. Continuously refine platform capabilities based on real-world usage.
  • Collaborate Across Teams: Work closely with application teams, SREs, and security engineers. Support platform adoption through strong technical partnership. Provide mentorship and guidance to engineers where needed.

Benefits

  • Competitive compensation package
  • Employee Equity Appreciation Program
  • Health and wellness insurance benefits
  • 401k with employer match
  • Flexible work environment
  • Unlimited paid time off (which includes paid holidays and Winter Break)
  • Paid parental leave
  • Tuition assistance, professional development, and opportunities for career growth
  • Best in class technology equipment for every employee
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service