Senior Product Engineer, Scalability

RailwaySan Francisco, CA

About The Position

Our core mission at Railway is to make software engineers higher leverage. We believe that people should be given powerful tools so that they can spend less time setting up to do, and more time doing. Railway now powers workloads for millions of builders, and the systems underneath — usage metering, billing and payments, fraud and abuse protection, background workers, and the data pipelines that feed them all — have to scale every week. You'll be the person who architects that stack: making it fast, correct, and trustworthy at a volume that keeps growing. If you're looking to scale the backbone of an operating system for builders, we'd love to talk with you! This is a backend-leaning role focused on scaling systems. Billing and fraud will be a focus, but your remit spans every high-throughput system at Railway — workers, queues, event pipelines, and the databases underneath them. You'll own your work end-to-end, including when a feature reaches the UI.

Requirements

  • An ability to autonomously lead, design, and implement backend systems where correctness, consistency, and auditability are first-class requirements.
  • A track record of scaling systems — you've taken a pipeline, service, or database that was falling over and made it handle 10x, and you know which tools to reach for (and when polling stops being enough).
  • Deep expertise in Postgres and relational data modeling — you reach for the right consistency guarantees, understand the cost of getting them wrong, and know how Postgres itself behaves at scale.
  • Strong working knowledge of Node.js internals — the event loop, memory behavior, and what to do when a service degrades under load.
  • Experience managing complex asynchronous and long-running backend jobs, ideally with a workflow engine like Temporal, for things like billing runs or payment reconciliation.
  • Familiarity with the realities of money movement: payment providers, idempotency, retries, reconciliation, and their failure modes. Direct billing, payments, or fraud experience is a strong plus.
  • A security and abuse-aware mindset — you instinctively think about how a system can be gamed, and you design accordingly.
  • A desire to be a part of the entire project development process, from research gathering and planning, to implementation and monitoring.
  • Great written and verbal communication skills for expressing ideas, designs, and potential solutions in a mostly-asynchronous manner.

Nice To Haves

  • Rust experience, or the desire to learn it

Responsibilities

  • Architect and scale the pipelines that turn raw usage into accurate, real-time billing — metering, aggregation, rating, and invoicing across millions of events, from ingestion in ClickHouse to the rating engine.
  • Build payment flows that are correct under concurrency and partial failure: idempotent charges, retries, reconciliation, and clean handling of provider edge cases (Stripe and beyond).
  • Develop fraud and abuse detection — signal collection, real-time scoring, automated mitigation — that protects platform margin without getting in legitimate users' way.
  • Scale the systems everything else depends on: Postgres under heavy write load, Node.js services under pressure, and long-running workflows orchestrated with Temporal where exactly-once semantics and durability actually matter.
  • Build TypeScript + GraphQL APIs where correctness and auditability are non-negotiable.
  • Write Engineering Requirement Documents to take something from idea, to defined tasks, to implementation, to monitoring its success and scaling it further.
  • Contribute to our open-source repositories (CLI, Typescript SDK, Railpack, etc.) — Rust experience, or the desire to learn it, helps here.
  • Be oncall from time to time.

Benefits

  • Great salary
  • Full health benefits including dependents
  • Strong equity grants
  • Equipment stipend
  • Autonomy: Very few meetings, just a Monday and a Friday to go over the Company Board.
  • Ownership: High ownership, high autonomy culture.
  • Novel problems/solutions: Well funded startup with cool problems allowing for novel solutions.
  • Growth: Support for professional growth, whether at Railway or outside.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service