OnePay-posted 3 months ago
Full-time • Mid Level

As a Site Reliability Engineer at OnePay, you will play a critical role in ensuring the stability, scalability, and security of the systems that power our financial products, driving reliability practices across infrastructure, platform, and application teams to support millions of customers.

  • Design, build, and maintain scalable infrastructure and tooling that improves reliability, performance, and availability across OnePay’s platform
  • Contribute to the evolution of our observability stack, platform libraries, cloud architecture, and CI/CD pipelines
  • Develop automation and monitoring systems to detect, prevent, and remediate incidents before they impact customers
  • Partner closely with product and platform engineering teams to embed reliability best practices in design, development, and deployment processes
  • Lead root cause analysis and postmortems, driving long-term improvements in resiliency and fault tolerance
  • 5+ years of experience as a Software Engineer with a focus on building and running reliable, large-scale, distributed systems in production
  • 5+ years of operational experience in observability tooling and libraries (metrics, logging, tracing) with experience using Datadog or similar tools (Prometheus, Grafana)
  • Proficiency in at least one programming language (Python, Go, Java, or Node.js preferred) for automation and tooling
  • Proficiency in incident management, going on-call, and writing post-mortem reports
  • Excellent collaboration skills with the ability to influence and educate product engineering teams on reliability and observability best practices
  • Hands-on experience with cloud platforms (AWS preferred), container orchestration (Kubernetes), and IAC tools (Terraform, Pulumi)
  • Drive and proactivity – everyone here is a builder and executor
  • Familiarity with functional programming concepts
  • Competitive base salary, stock options, and health benefits from Day 1
  • 401(k) plan with company match
  • Remote-friendly (US), flexible time off (FTO), and opportunities for growth
  • A high-growth, mission-driven, inclusive culture where your work has real impact
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service