Software Engineer V - Infra/SRE

Mighty Acorn Digital

2d•Remote

About The Position

At Mighty Acorn, we build digital services that real people depend on — to access benefits, fulfill obligations, and interact with their government. When those services go down or degrade, the consequences aren't just technical: they erode public trust and create real hardship for the constituents we serve. Reliability isn't a back-office concern here; it's central to our mission. As a Software Engineer V specializing in Infrastructure and SRE, you'll lead the design and implementation of a monitoring strategy for a high-availability application in a government context. That means operating in environments that are more complex, more constrained, and higher-stakes than most commercial settings. You'll need the technical depth to build robust observability infrastructure and the communication skills to earn trust with government stakeholders — translating reliability posture and risk into terms that resonate across organizational boundaries. This is a fully remote position. Candidates must be based in and work from the contiguous United States, with at least a 5-hour overlap with 9am–5pm ET, Monday through Friday. This Position Is Contingent, Pending Contract Award.

Requirements

10+ years of engineering experience, with significant time spent in SRE, platform, or infrastructure-focused roles.
Hands-on experience building and managing infrastructure with Terraform in AWS environments.
Deep familiarity with AWS observability tooling and services, including hands-on experience with ECS, RDS, and EventBridge.
Experience implementing and operating APM and monitoring platforms such as New Relic.
Ability to read, understand, and work alongside TypeScript/JavaScript application codebases — enough to instrument effectively and debug across the stack.
Experience operating systems that process personally identifiable information (PII), with sound judgment about the operational and security practices that entails.
Demonstrated experience leading a technical team in a high-trust, high-velocity environment — setting direction, maintaining standards, and developing the people around you.
Experience working in or alongside government agencies, with an understanding of the organizational dynamics and constraints involved.
Strong communication skills across technical and non-technical audiences, including the ability to translate complex reliability concepts for stakeholders without an engineering background.
Curiosity, patience, and resilience when navigating ambiguous or rapidly changing environments.
A Bachelor's degree (or equivalent experience) is contractually required for this role.
An ability to work efficiently, sometimes under tight deadlines.
A preference for transparency and an ability to be direct and transparent in your own communication.
An ability to adapt quickly and cope with temporarily ambiguous situations as requirements change.
This role requires work be performed from within the contiguous United States.
Candidates must either hold active US citizenship or a green card, and should possess work authorization that does not require any present or future visa sponsorship by Mighty Acorn Digital.
Candidates selected for the role must pass a criminal background check prior to their start date.
Candidates must have a fast (>100Mbps) and reliable internet connection and have a dedicated workspace with background noise at an appropriate level for audio calls.

Nice To Haves

Experience with Azure monitoring tools and multi-cloud observability strategies.
Experience building or maintaining CI/CD pipelines using GitHub Actions.
Experience working in professional services or government digital services.
Experience building products and services for all users, regardless of ability, backed by knowledge of accessibility standards (Section 508 Refresh/WCAG 2.0 A and AA).
Experience or interest in sharing knowledge through mentoring, writing, or industry conferences.

Responsibilities

Leading the design and implementation of a comprehensive monitoring strategy for a high-availability application, balancing immediate operational needs with long-term sustainability.
Building and maintaining observability infrastructure using Terraform, integrating AWS-native monitoring services with New Relic to provide full-stack visibility.
Collaborating closely with application engineers working on TypeScript/JavaScript services running on AWS ECS, with RDS and EventBridge in the stack — understanding the application well enough to instrument it effectively.
Establishing reliability standards, runbooks, alerting thresholds, and incident response practices that the broader team can own and operate.
Leading and mentoring a technical team, setting direction, unblocking others, and coaching engineers through ambiguous and high-pressure situations.
Working directly with government stakeholders to communicate the reliability posture of the application, surface risk, and build confidence in the systems you're responsible for.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume