IAM IGA Data Engineer, Assistant Vice President

State StreetPrinceton, NJ
Hybrid

About The Position

State Street cyber architecture & engineering is looking for a Data Engineer to design and implement data models across relational, graph, and lakehouse systems using AWS RDS, AWS Neptune, and Databricks. This role will also contribute to AI enablement by building GraphRAG (Graph Retrieval-Augmented Generation) pipelines that power intelligent search and LLM-based applications. You’ll work closely with senior engineers and AI teams to deliver scalable, secure, and high-performance data solutions. This role can be performed in a hybrid model, where you can balance work from home and office to match your needs and role requirements.

Requirements

  • 3–5 years of experience in data engineering or similar roles.
  • Strong SQL and schema design for AWS RDS (PostgreSQL/MySQL).
  • Hands-on experience with AWS Neptune (Gremlin/SPARQL) and graph modeling.
  • Proficiency in Databricks, PySpark, and Delta Lake.
  • Familiarity with GraphRAG concepts, embeddings, and vector search.
  • Programming in Python for data pipelines and API integration.

Nice To Haves

  • Master’s or Bachelor’s Degree in Computer Science or engineering.
  • 7–10 years of experience in data analysis, data model design, and implementation to deliver scalable, enterprise-grade data solutions in a fast-paced environment.
  • Manage/mentor team members, and provide training as required.
  • Excellent analytical and interpersonal skills.
  • Excellent verbal, written communication and presentation skills.
  • Very strong practical experience in implementing large mature incident and change management processes
  • Team player, Ethical, Self-driven and self-motivated, goal and results oriented.
  • Experience in building, managing diverse geographically located teams and championing customer satisfaction.
  • Experience in support or implementation of IAM products.
  • Experience in implementing SDLC engagements projects, that must include activities such as requirements gathering, analysis, design, development, testing, deployment and application support.
  • Experience with LangChain or LlamaIndex for RAG workflows.
  • Knowledge of graph algorithms, entity linking, and semantic search.
  • Exposure to OpenSearch, FAISS, or Databricks Vector Search.
  • AWS/Databricks certifications are a plus.

Responsibilities

  • Design relational schemas in AWS RDS and graph models in AWS Neptune/Neo4J.
  • Develop warehouse/lakehouse schemas in Databricks Delta Lake for analytics and ML.
  • Implement graph-based retrieval and integrate with vector search for AI use cases.
  • Generate embeddings and manage hybrid retrieval pipelines (graph + vector).
  • Build ETL/ELT workflows using Databricks (PySpark), Airflow, or AWS Glue.
  • Ensure data quality, consistency, and freshness across systems.
  • Tune queries and optimize storage for Neptune, RDS, and Delta Lake.
  • Work with data scientists and AI engineers to support knowledge graph and RAG workflows.
  • Document data models and maintain governance standards.

Benefits

  • retirement savings plan (401K) with company match
  • insurance coverage including basic life, medical, dental, vision, long-term disability, and other optional additional coverages
  • paid-time off including vacation, sick leave, short term disability, and family care responsibilities
  • access to our Employee Assistance Program
  • incentive compensation including eligibility for annual performance-based awards
  • eligibility for certain tax advantaged savings plans
  • inclusive development opportunities
  • flexible work-life support
  • paid volunteer days
  • vibrant employee networks
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service