Babel Street is the trusted technology partner for the world’s most advanced identity intelligence and risk operations. We deliver advanced AI and data analytics solutions providing unmatched, analysis-ready data regardless of language, proactive risk identification, 360-degree insights, high-speed automation, and seamless integration into existing systems. Babel Street empower s government and commercial organizations to transform high-stakes identity and risk operations into a strategic advantage . The actionable insights we deliver safeguard lives and protect critical assets around the world . Babel Street is headquartered in Reston, Virginia , with regional offices in Boston, MA and Cleveland, OH, and international offices in Australia, Canada, Israel, Japan, and the U.K. For more information, visit www. babelstreet.com . ROLE SUMMARY As Head of Data you will define and lead Babel Street’s North Star Data Strategy and Architecture, building a cohesive, scalable, and AI-native data platform that transforms fragmented systems into a unified foundation for intelligence, analytics, and product innovation. You will own the full lifecycle of data across the organization, from ingestion and storage to semantics, retrieval, and productization. Your mandate is to unify fragmented systems into a cohesive, scalable, and AI-ready data platform that directly enables investigative, analytical, and operational workflows across Babel Street’s product suite. You will work closely with Product and Engineering leadership, to ensure that data is not just infrastructure, but a core competitive advantage. This includes powering Babel Street’s Knowledge Graph, enabling agentic and generative AI systems, and delivering data capabilities that are reliable, performant, and economically efficient at scale. This role requires deep technical expertise, strong architectural judgment, and the ability to translate complex data challenges into customer-impacting intelligence capabilities. This hybrid role will be based in our Reston, VA or Somerville, MA office. ROLE SPAN This role spans four integrated domains: 1. Data Platform & Storage Architecture You will help define and evolve Babel Street’s unified data platform, consolidating warehouse, search and object storage systems into a cohesive scalable foundation. This includes establishing clear patterns for when and how to use analytical warehouses, search/index systems, and object storage to support diverse workloads across the business. You will architect systems that operate at petabyte scale, ensuring high performance, reliability, and flexibility across batch and real-time data. A key focus will be driving platform rationalization while maintaining continuity of operations and minimizing risk. You will also establish standards for data ingestion, transformation, and lifecycle management, ensuring consistency and efficiency across the platform. 2. Data Semantics, Knowledge Graph & Identity You will own the semantic foundation of Babel Street’s data ecosystem, defining how data is modeled, connected, and understood across products and systems. This includes building and evolving the company's knowledge graph, including entity resolution, identity modeling, and relationship mapping across disparate data sources. You will establish ontology and schema strategies that ensure consistent interpretation of data across teams, products, and AI systems. Your work will enable graph-integrated reasoning and provide the structures context required for intelligence workflows and AI-driven applications. 3. Data Access, Retrieval & AI-Enablement You will design and operate data access patterns that power both human and machine consumption of data, including API’s, query layers and retrieval systems. This includes enabling hybrid retrieval approaches across structured, unstructured and vector-based data to support LLMs, RAG pipelines, and agentic systems. You will ensure that data is accessible in a way that is performant, scalable, and optimized for AI workloads. You will partner closely with AI and Applied ML teams to ensure seamless integration between data systems and model-driven capabilities, enabling reliable, explainable, and efficient intelligence generation. 4. Data Productization, Governance & Economics You will establish a data-as-a-product operating model, ensuring that data assets are discoverable, reusable, and governed with clear ownership and accountability. This includes defining contracts, enforcing quality standards, and implementing metadata and governance frameworks that scale across the organization. You will also own the economics of the data platform, ensure efficient use of storage and compute, and optimize cost per query, cost per workload, and overall system efficiency. A key focus will be enabling scalable AI usage through efficient data retrieval and storage strategies.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Executive
Number of Employees
101-250 employees