Software Engineer V - Infra/SRE

Mighty Acorn Digital
2dRemote

About The Position

At Mighty Acorn, we build digital services that real people depend on — to access benefits, fulfill obligations, and interact with their government. When those services go down or degrade, the consequences aren't just technical: they erode public trust and create real hardship for the constituents we serve. Reliability isn't a back-office concern here; it's central to our mission. As a Software Engineer V specializing in Infrastructure and SRE, you'll lead the design and implementation of a monitoring strategy for a high-availability application in a government context. That means operating in environments that are more complex, more constrained, and higher-stakes than most commercial settings. You'll need the technical depth to build robust observability infrastructure and the communication skills to earn trust with government stakeholders — translating reliability posture and risk into terms that resonate across organizational boundaries. This is a fully remote position. Candidates must be based in and work from the contiguous United States, with at least a 5-hour overlap with 9am–5pm ET, Monday through Friday. This Position Is Contingent, Pending Contract Award.

Requirements

  • 10+ years of engineering experience, with significant time spent in SRE, platform, or infrastructure-focused roles.
  • Hands-on experience building and managing infrastructure with Terraform in AWS environments.
  • Deep familiarity with AWS observability tooling and services, including hands-on experience with ECS, RDS, and EventBridge.
  • Experience implementing and operating APM and monitoring platforms such as New Relic.
  • Ability to read, understand, and work alongside TypeScript/JavaScript application codebases — enough to instrument effectively and debug across the stack.
  • Experience operating systems that process personally identifiable information (PII), with sound judgment about the operational and security practices that entails.
  • Demonstrated experience leading a technical team in a high-trust, high-velocity environment — setting direction, maintaining standards, and developing the people around you.
  • Experience working in or alongside government agencies, with an understanding of the organizational dynamics and constraints involved.
  • Strong communication skills across technical and non-technical audiences, including the ability to translate complex reliability concepts for stakeholders without an engineering background.
  • Curiosity, patience, and resilience when navigating ambiguous or rapidly changing environments.
  • A Bachelor's degree (or equivalent experience) is contractually required for this role.
  • An ability to work efficiently, sometimes under tight deadlines.
  • A preference for transparency and an ability to be direct and transparent in your own communication.
  • An ability to adapt quickly and cope with temporarily ambiguous situations as requirements change.
  • This role requires work be performed from within the contiguous United States.
  • Candidates must either hold active US citizenship or a green card, and should possess work authorization that does not require any present or future visa sponsorship by Mighty Acorn Digital.
  • Candidates selected for the role must pass a criminal background check prior to their start date.
  • Candidates must have a fast (>100Mbps) and reliable internet connection and have a dedicated workspace with background noise at an appropriate level for audio calls.

Nice To Haves

  • Experience with Azure monitoring tools and multi-cloud observability strategies.
  • Experience building or maintaining CI/CD pipelines using GitHub Actions.
  • Experience working in professional services or government digital services.
  • Experience building products and services for all users, regardless of ability, backed by knowledge of accessibility standards (Section 508 Refresh/WCAG 2.0 A and AA).
  • Experience or interest in sharing knowledge through mentoring, writing, or industry conferences.

Responsibilities

  • Leading the design and implementation of a comprehensive monitoring strategy for a high-availability application, balancing immediate operational needs with long-term sustainability.
  • Building and maintaining observability infrastructure using Terraform, integrating AWS-native monitoring services with New Relic to provide full-stack visibility.
  • Collaborating closely with application engineers working on TypeScript/JavaScript services running on AWS ECS, with RDS and EventBridge in the stack — understanding the application well enough to instrument it effectively.
  • Establishing reliability standards, runbooks, alerting thresholds, and incident response practices that the broader team can own and operate.
  • Leading and mentoring a technical team, setting direction, unblocking others, and coaching engineers through ambiguous and high-pressure situations.
  • Working directly with government stakeholders to communicate the reliability posture of the application, surface risk, and build confidence in the systems you're responsible for.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service