Senior Software Engineer - Cloud Services

The Trade DeskSan Francisco, CA
Onsite

About The Position

The Trade Desk is a global technology company and the world’s leading independent platform for digital advertising. Our technology helps advertisers reach the right audiences across the open internet. Advertising powers the content people love. By making it more transparent, effective, and responsible, we help support trusted journalism, quality entertainment, and creators worldwide. The world’s brands and agencies rely on us to reach their customers and grow their businesses responsibly. The scale of our platform brings unique technical challenges — from processing massive datasets in real time to building systems that operate reliably on a global scale. When you work here, your impact is worldwide. We welcome diverse perspectives, encourage curiosity, and build teams that learn from one another. If you’re driven to solve meaningful challenges, we’d love to meet you. The Trade Desk approach to infrastructure is in the midst of an exciting transformation toward SOA and we need your help! Up until now we've grown to massive scale via a centralized Site Reliability Engineering (SRE) team. As embracers of change, we've decided that the best route forward is through smaller, focused teams. Now that we've decentralized, we're working on a self-service, Infrastructure as a Service, transformation. We have opportunities working with Kubernetes, Kafka, multiple cloud providers, and an ever-expanding physical server footprint. Transformations are challenging but they provide multiple opportunities for a positive impact on a growing company. If this sounds like an exciting pursuit, we'd love to talk to you! ABOUT THE ROLE: As a Senior Software Engineer with a specialization in Infrastructure you will: Create and maintain in-house service oriented solutions at scale for the infrastructure required to run a globally distributed system handling over 15 million requests per second Help product teams ship more efficiently and safely through automation, tools, and processes which can be used by all teams at The Trade Desk. Ensure supportability by innovating solutions for our infrastructure through building, implementing, operating, and adding features to self-service tooling and automation. Participate in root-cause analysis and postmortem discussions to effectively drive long-term operational health improvements. Analyze for process gaps and implement solutions to speed up execution and reduce manual toil. While this is not strictly a Site Reliability Engineer (SRE) role, there are elements of the mindset that apply-- configuration management, capacity modeling, monitoring, data collection and analysis, key performance indicator definitions, and tracking. Participate in a 24/7 on-call rotation.

Requirements

  • Experience writing clean, maintainable, and well-tested code in any of the following languages: TypeScript, Go/Golang, or C#
  • Designing, developing, deploying, and supporting service-oriented applications
  • Domain knowledge in one or more of the following: Kubernetes, Docker, ArgoCD, Backstage
  • Domain knowledge in one or more of the following: Kafka, Service Discovery (i.e. Consul)
  • Domain knowledge in one or more of the following: AWS, Azure, Alibaba Cloud (Aliyun)
  • Domain knowledge in one or more of the following: Linux operating system internals, filesystems, storage technologies, protocols, and networking stack
  • Domain knowledge in one or more of the following: GitOps tools such as Terraform, Ansible, or CloudFormation
  • An understanding of systems design as well as the advantages or drawbacks to various approaches.
  • Experience building always-on systems, working across a variety of technologies and service layers.
  • Uses a data driven approach to both daily time investments as well as long-term bets.
  • Operate in a way that reduces complexity and cuts operational risks with a solid grasp of costs and the return on investment-- of time, implementation, customer impact.
  • A track record of making significant and self-directed, contributions to large and impactful projects.
  • Actively communicate with your team as well as across the organization to effectively drive toward a unified goal.
  • Practical problem solving, superb communication and documentation skills.
  • Thinking beyond the task at hand to deeply understand the 'why' behind an objective.
  • A welcoming of ideas, and understanding of, perspectives that are different from your own and an interest in seeking and building from a common ground.
  • You are a creative thinker, not bound by "the way things have always been done" but are thinking of the questions nobody has thought of and are "yet to be asked". What you know is less important than how well you learn, innovate, collaborate, and adapt.
  • As a global team from many diverse backgrounds, experiences, and perspectives, you value and seek out paths for fostering diversity.

Responsibilities

  • Create and maintain in-house service oriented solutions at scale for the infrastructure required to run a globally distributed system handling over 15 million requests per second
  • Help product teams ship more efficiently and safely through automation, tools, and processes which can be used by all teams at The Trade Desk.
  • Ensure supportability by innovating solutions for our infrastructure through building, implementing, operating, and adding features to self-service tooling and automation.
  • Participate in root-cause analysis and postmortem discussions to effectively drive long-term operational health improvements.
  • Analyze for process gaps and implement solutions to speed up execution and reduce manual toil.
  • Participate in a 24/7 on-call rotation.

Benefits

  • comprehensive healthcare (medical, dental, and vision) with premiums paid in full for employees and dependents
  • retirement benefits such as a 401k plan and company match
  • short and long-term disability coverage
  • basic life insurance
  • well-being benefits
  • reimbursement for certain tuition expenses
  • parental leave
  • sick time of 1 hour per 30 hours worked
  • vacation time for full-time employees up to 120 hours thru the first year and 160 hours thereafter
  • around 13 paid holidays per year
  • Employees can also purchase The Trade Desk stock at a discount through The Trade Desk’s Employee Stock Purchase Plan.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service