RE Build Manufacturing, LLC-posted about 2 months ago
Full-time • Mid Level
Remote • Framingham, MA
501-1,000 employees
Publishing Industries

The Data Engineer will focus on utilizing modern data technologies to operationalize and expand the enterprise Data Lake. This role centers on implementing efficient ingestion strategies, integrating diverse data sources, and ensuring data is structured for accessibility and analysis. The engineer will work across hybrid environments-on-prem and cloud-to automate data movement, validate data quality, and enable analytical teams to derive insights from trusted, well-organized datasets. This position requires hands-on technical depth in data ingestion and transformation, along with the analytical understanding needed to align data availability with business and reporting needs.

  • Co-design data interfaces and pipelines in close collaboration with software engineers and technical leads, ensuring alignment with application domain models and product roadmaps.
  • Build and operate batch, streaming, and change data capture (CDC) pipelines from diverse sources (ERP, CRM, Accounting, knowledge repositories, and other enterprise systems) into the data lake.
  • Model curated data within the lake into data warehouse structures (e.g., star schemas, wide tables, semantic layers) optimized for business intelligence (BI), ad-hoc analytics, and key performance indicator (KPI) reporting.
  • Publish certified datasets and policy-aware retrieval assets (tables, document embeddings, vector indexes) to enable analytics, AI, and retrieval-augmented generation (RAG) use cases.
  • Establish robust data observability and quality checks to ensure reliability and consistency.
  • Apply governance, security, and compliance controls across the data lake and warehouse - including role-based access, encryption, auditing, and data retention - in alignment with applicable regulations.
  • Operate the platform reliably by orchestrating jobs, monitoring pipelines, and continuously tuning cost and performance.
  • Work in accordance with The Re:Build Way, demonstrating collaboration, continuous improvement, and technical excellence in every aspect of data engineering.
  • 3+ years of proven experience building production-grade data systems with a strong understanding of cloud-based data lake architectures and data warehouses.
  • Demonstrated expertise in designing and operating data pipelines (batch, streaming, CDC), including schema evolution, backfills, and performance tuning.
  • Hands-on proficiency with Python and SQL, including experience with distributed processing frameworks (e.g., Apache Spark) and CI/CD for data workflows.
  • Proven ability to design and implement ETL/ELT workflows and data modeling techniques (e.g., star schemas, wide tables, semantic models).
  • Proficiency with cloud data platforms and services such as AWS, Databricks, and Snowflake, with a focus on scalability and reliability.
  • Familiarity with open table formats (e.g., Iceberg, Delta, Hudi) and business intelligence data modeling.
  • Understanding of data governance, lineage, and data quality frameworks to ensure reliability, accuracy, and compliance.
  • Experience or strong interest in enabling AI/ML use cases (e.g., RAG/search datasets, embeddings, vector indexes).
  • Bachelor's degree (BA/BS) in Computer Science, Data Science, Mathematics, Analytics, or a related quantitative field (or equivalent experience).
  • Fluency in written and spoken English.
  • Brings enthusiasm, curiosity, and a consistently positive attitude.
  • Leads by example - offering guidance, mentorship, and accountability on key technical decisions.
  • Skilled at analyzing complex technical challenges and delivering innovative, efficient solutions.
  • Flexible and adaptable to shifting priorities, requirements, and emerging technologies.
  • Communicates clearly and effectively, both in writing and verbally.
  • Exceptionally organized and thrives in a fast-paced, dynamic environment.
  • Strong analytical and problem-solving abilities with sharp attention to detail.
  • Collaborative team player who works effectively across departments and levels of the organization.
  • Must successfully complete a background check and provide reliable professional references.
  • Competitive Base Pay
  • All Re:Build Employees are eligible for performance-based bonus
  • All Re:Build Employees receive Re:Build incentive stock awards, annual
  • Competitive, Comprehensive Benefits Plan.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service