eBay-posted 4 months ago
$132,000 - $222,100/Yr
Full-time • Mid Level
Austin, TX
5,001-10,000 employees

At eBay, we're more than a global ecommerce leader — we’re changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190 markets around the world. We’re committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts. Our customers are our compass, authenticity thrives, bold ideas are welcome, and everyone can bring their unique selves to work — every day. We're in this together, sustaining the future of our customers, our company, and our planet. Join a team of passionate thinkers, innovators, and dreamers — and help us connect people and build communities to create economic opportunity for all. At eBay, data flows swiftly within our marketplace — fueling countless real-time interactions among buyers and seller worldwide. To support this, our Rheos team provides the fully managed, continuously scaling stream processing and messaging data platform that enables near real-time buyer experiences, seller insights, and a data-driven commerce business. We are seeking an experienced Software Engineer with deep expertise in data streaming and event-driven systems such as Apache Kafka and Apache Flink. In this role, you will develop, build, and operate the core data pipeline infrastructure that underpins eBay’s most critical applications. This position blends platform engineering—developing new features, automation, and tooling—with operational ownership including cluster management, performance optimization, and troubleshooting. You’ll play a key role in ensuring our distributed systems are reliable, scalable, and efficient, while collaborating with a world-class team to redefine how information moves through one of the largest commerce platforms in the world.

  • Build, operate, and continuously optimize eBay’s messaging and streaming platform, delivering reliability, scalability, and high performance at global scale.
  • Develop and implement new functionalities on the platform and automation tools to boost system resilience and improve developer efficiency.
  • Troubleshoot and resolve complex production issues with a focus on minimizing downtime and maintaining business continuity.
  • Strengthen system monitoring, logging, and alerting to ensure proactive detection and resolution of problems.
  • Create and maintain comprehensive documentation, including system designs, operational runbooks, and best practices to support long-term platform health.
  • 5+ years of relevant experience along with a Master’s degree in Computer Science or related field (or equivalent experience).
  • Strong proficiency in Java and common design patterns.
  • Hands-on experience with streaming and messaging technologies such as Apache Kafka, Flink, and Pulsar.
  • Proven problem-solving skills and expertise in troubleshooting production issues.
  • Familiarity with monitoring and observability tools like Grafana, Prometheus, and ELK.
  • Experience with Kafka and Flink cluster operations is a significant plus.
  • Strong knowledge of Kubernetes and containerized environments.
  • Familiarity with databases such as Oracle, MySQL, and Redis.
  • Deep understanding of distributed system design principles, including high availability, scalability, and fault tolerance.
  • 401(k) eligibility
  • various paid time off benefits, such as PTO and parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service