Senior Staff Engineer - Event Platform Storage

DatadogNew York, NY
14dHybrid

About The Position

We are looking for a Staff Engineer to join our Event Platform Storage team. Our Event Platform ingests, transforms and stores events to provide more than 30 Datadog products data retrieved by a query API at a rate of ~15 Million messages/second. With a focus on a high-level of reliability, you will contribute to a platform using Java, Go and Rust. Engineers with a background or interest in the challenges of optimizing distributed systems for durability, high availability, low latency and scalability are encouraged to apply. Husky Blog Husky Deep Dive This is a unique opportunity to contribute to one of the most critical platforms at Datadog. This platform provides a scalable, exactly-once, cost-effective, and reliable storage engine for timestamped payloads with a full-featured and fast query engine with multiple integration options.. We have challenges to continue scaling these components and add new features important for our customers. This is an opportunity to accelerate the growth of one of the most profitable parts of the business. At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them.

Requirements

  • You have led cross-team initiatives in a platform or infrastructure-focused environment for 2+ years.
  • Passionate about performance and efficiency optimization.
  • You have led impactful technical initiatives in an environment where performance, reliability, and accuracy are first-order concerns
  • You have a reliability-oriented mindset and care deeply about designing and building resilient architectures
  • You have significant back end programming experienced and have architected, built, and operated distributed systems to solve problems at high scale
  • You’re excited about leveraging AI tools to enhance how you code, solve problems, and build – or eager to learn how

Responsibilities

  • Design and drive high priority, high visibility projects that increase the platform's resilience and scalability across multiple teams
  • Lead and guide others through architectural decisions for new and existing distributed, high-throughput, real-time systems
  • Identify potential system risks and trends in reliability, and design solutions to address them
  • Provide input on prioritization of engineering-led initiatives in short- and long-term planning and roadmaps
  • Collaborate closely with partner platforms that integrate and depend on the event platform to provide critical capabilities to their customers

Benefits

  • Get to build tools for software engineers, just like yourself.
  • And use the tools we build to accelerate our development.
  • Have a lot of influence on product direction and impact on the business
  • Work with skilled, knowledgeable, and kind teammates who are happy to teach and learn
  • Competitive global benefits
  • Continuous professional development
  • Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

1,001-5,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service