The Trade Desk is a global technology company and the world’s leading independent platform for digital advertising. Our technology helps advertisers reach the right audiences across the open internet. Advertising powers the content people love. By making it more transparent, effective, and responsible, we help support trusted journalism, quality entertainment, and creators worldwide. The world’s brands and agencies rely on us to reach their customers and grow their businesses responsibly. The scale of our platform brings unique technical challenges — from processing massive datasets in real time to building systems that operate reliably on a global scale. When you work here, your impact is worldwide. We welcome diverse perspectives, encourage curiosity, and build teams that learn from one another. If you’re driven to solve meaningful challenges, we’d love to meet you. The Trade Desk approach to infrastructure is in the midst of an exciting transformation toward SOA and we need your help! Up until now we've grown to massive scale via a centralized Site Reliability Engineering (SRE) team. As embracers of change, we've decided that the best route forward is through smaller, focused teams. Now that we've decentralized, we're working on a self-service, Infrastructure as a Service, transformation. We have opportunities working with Kubernetes, Kafka, multiple cloud providers, and an ever-expanding physical server footprint. Transformations are challenging but they provide multiple opportunities for a positive impact on a growing company. If this sounds like an exciting pursuit, we'd love to talk to you! ABOUT THE ROLE: As a Senior Software Engineer with a specialization in Infrastructure you will: Create and maintain in-house service oriented solutions at scale for the infrastructure required to run a globally distributed system handling over 15 million requests per second Help product teams ship more efficiently and safely through automation, tools, and processes which can be used by all teams at The Trade Desk. Ensure supportability by innovating solutions for our infrastructure through building, implementing, operating, and adding features to self-service tooling and automation. Participate in root-cause analysis and postmortem discussions to effectively drive long-term operational health improvements. Analyze for process gaps and implement solutions to speed up execution and reduce manual toil. While this is not strictly a Site Reliability Engineer (SRE) role, there are elements of the mindset that apply-- configuration management, capacity modeling, monitoring, data collection and analysis, key performance indicator definitions, and tracking. Participate in a 24/7 on-call rotation.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed