Aurecon-posted 4 months ago
Senior
Manila, AR
5,001-10,000 employees
Professional, Scientific, and Technical Services

At Aurecon, we are seeking a Senior Data Engineer specializing in Generative AI to architect and build the data infrastructure that powers our next-generation AI applications. In this role, you will be at the forefront of designing scalable data pipelines, vector databases, and embedding systems that enable LLM-powered solutions for infrastructure and engineering challenges across the APAC region. You will bridge the gap between traditional data engineering and cutting-edge AI technologies, ensuring our AI agents and applications have access to high-quality, real-time data.

  • Design and implement end-to-end data pipelines for ingesting, processing, and structuring unstructured data (documents, PDFs, images) for AI consumption.
  • Build and optimise vector database systems and embedding pipelines for large-scale similarity search and RAG applications.
  • Develop robust data orchestration workflows using graph-based execution frameworks to support complex AI data requirements.
  • Implement data quality, governance, and lineage tracking systems specifically tailored for AI/ML workloads.
  • Design and build MCP-compatible data services and tools that integrate seamlessly with AI applications.
  • Collaborate with ML engineers to prepare and curate datasets for LLM evaluation, fine-tuning, and performance monitoring.
  • Establish best practices for incremental data updates, versioning, and metadata management in RAG systems.
  • Build/operate monitoring and alerting systems for data pipeline health, quality metrics, and AI model data drift.
  • Bachelor's degree in computer science, Engineering, or related field (or equivalent experience).
  • 3+ years of experience in data engineering with at least 1 year focusing on AI/ML data systems.
  • Demonstrated experience building production data systems that support AI applications.
  • Strong problem-solving abilities and self-directed learning mindset.
  • Excellent communication skills for technical and non-technical audiences.
  • Experience with Model Context Protocol (MCP) implementations and tool integration.
  • Knowledge of data preparation for LLM fine-tuning and dataset curation.
  • Familiarity with multi-modal data pipelines (text, image, video, audio).
  • Experience building developer-facing data APIs and services.
  • Flexibility - 1x every fortnight reporting in the office.
  • Wellbeing - we prioritize your health.
  • Recognition - your impact matters.
  • Family - support for modern families and carers.
  • Community - give back through volunteering days.
  • Career development - learn, lead and shape your career.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service