Data Engineer

OutdoorsyAustin, TX
$140,000 - $200,000

About The Position

The Outdoorsy Group is seeking an experienced Data Engineer to drive two business critical functions: the synthesis of data sources to develop a Customer 360 view across our suite of products, and achieving AI readiness by creating real-time data flows that connect to AI agents. To date, our data has been siloed across different products and tools, each with their own identifiers and structures. The ideal candidate will be able to show a track record of success building scalable data pipelines that solve Identity Resolution challenges, as well as experience building infrastructure for LLM-based applications. If you’re self-driven and looking to make an impact in a fast-paced environment, then we’d love to have you!

Requirements

  • 3+ years data engineering experience building production-grade data warehouses and pipelines. Travel, marketplace, or insurance experience is a bonus, but not essential.
  • Advanced expertise in SQL and Python, with the ability to write complex transformations and fuzzy matching algorithms, and handle messy 3rd-party data formats.
  • Direct experience with ‘Customer 360’ or ‘Golden Rule’ projects that establish source-of-truth customer profiles across disparate products and data sources.
  • Experience with streaming technologies (Kafka, Pub/Sub) or building infrastructure for LLM-based applications (Knowledge Graphs, Vector Databases, RAG).
  • You work effectively across departments, e.g. Product, Engineering, and Finance, and understand that great products come from diverse perspectives.
  • You thrive in fast-paced environments where priorities shift quickly, and you're energized by solving real customer problems.
  • Degree in a technical quantitative field like computer science, data science, and engineering, or equivalent practical experience.

Responsibilities

  • Enhance Identity Resolution such that there is a source-of-truth Customer 360 view that links users across our RV marketplace, internal insurance products, and 3rd-party insurance partners.
  • Develop fuzzy matching algorithms to triangulate single users across data that lacks common IDs.
  • Build and maintain scalable ELT/ETL pipelines (using tools like dbt, Airflow, FiveTran) that sync siloed SQL databases, external CRM data, and 3rd-party insurance and claims data into a unified warehouse (Redshift/BigQuery).
  • Architect real-time data flows to feed an AI agent, achieving AI Readiness. Through Knowledge Graphs, Vector Databases, and Retrieval-Automated Generation, ensure agents can access e.g. latest policy status, upsell propensity scores, and customer history in near real-time.
  • Own all aspects of data governance between our operational systems and the AI agent to ensure privacy, security and high-fidelity responses.

Benefits

  • Competitive Compensation: Base salary of $140k–$200k DOE.
  • Growth Opportunity: Join a company in its early stages and help build its foundation for success.
  • Equity: Opportunity to earn equity compensation.
  • Comprehensive Benefits: 100% company-paid medical premiums for employees, 401(k) [Match], and Flexible PTO.
  • Vibrant Culture: Lively Austin HQ with catered lunches, Happy Hours, and team-building events.
  • Work-Life Balance: Enjoy RV rental discounts and a company culture that values a healthy balance between work and adventure.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service