Software Developer II - Data Platform

Rocket CompaniesSeattle, WA

About The Position

The Data Ingestion and Infrastructure and Management team builds integrations, infrastructure pipelines and tools to import key MLS, Property, Schools and School Ratings, climate data as well as other inventory data that have made Redfin the most trusted real-estate site in the US. The Data Ingestion and Infrastructure team supports our Data Platform and helps build the systems that power Redfin’s data ecosystem. In this role, you will design, develop, and operate data pipelines and services that ingest, process, and deliver large-scale datasets used by machine learning, analytics, and product teams. You will work closely with engineers across the organization to build reliable, scalable data systems using technologies such as Spark, Python/Java, Airflow, and cloud data platforms. This role is ideal for an engineer who enjoys solving complex data problems, improving platform capabilities, and delivering data systems that impact millions of users.

Requirements

  • 3–5 years of experience building software systems, data pipelines, or backend services in production environments.
  • Experience developing large-scale applications backed by relational and non-relational databases and working with distributed data processing technologies such as Spark, Kafka, or similar systems.
  • Comfortable designing and implementing data pipelines and services that operate reliably at scale.
  • Collaborate effectively with engineers, data scientists, and product teams to translate business needs into scalable technical solutions.
  • Care deeply about data quality, system reliability, and building maintainable systems that other engineers can easily use and extend.
  • Curious and proactive about learning new technologies and continuously improving Redfin’s data platform and engineering practices.

Responsibilities

  • Design, build, and maintain scalable data pipelines that ingest, process, and organize large datasets (such as listings, clickstream, and external data sources) into Redfin’s data lake and analytics platforms.
  • Develop and operate distributed data processing applications using technologies such as Spark, Python/Java, and workflow orchestration tools (e.g., Airflow/Windfarm) that power machine learning, product features, and analytics.
  • Take ownership of data pipelines and platform services, ensuring reliability, performance, and data quality across Redfin’s data ecosystem.
  • Lead the development and modernization of data pipelines by migrating legacy systems to Redfin’s lakehouse architecture and standardized platform frameworks.
  • Collaborate with engineers, data scientists, and product teams to design and deliver datasets and data services that enable new product capabilities and machine learning models.

Benefits

  • medical, dental, and vision benefits
  • 401K retirement plan
  • paid-time off
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service