Principal Data Architect

HalvikVienna, VA
23h

About The Position

Halvik Corp delivers a wide range of services to 13 executive agencies and 15 independent agencies. Halvik is a highly successful WOB business with more than 50 prime contracts and 500+ professionals delivering Digital Services, Advanced Analytics, Artificial Intelligence/Machine Learning, Cyber Security and Cutting-Edge Technology across the US Government. Be a part of something special. Role and Responsibilities This position will serve as a primary data architect for the Department of Transportation (DOT) Securet Data Commons / Data Analytics and Visualization Environment (SDC/DAVE) platform. The architect will lead the design of data lakes, data warehouses, and visualization services and will work with data and cloud engineers to design pipelines that ingest, clean, and transform both batch and streaming data. The role requires a broad foundation in data architecture with the aptitude and drive to rapidly learn new tools and platforms as the program evolves. Specific responsibilities include:

Requirements

  • Must be able to obtain and maintain Public Trust security clearance
  • Bachelor's degree in Computer Science, Information Systems, or related field, or 6+ years of equivalent experience.
  • Data Warehousing / Data Architecture (10+ years)
  • Designing and architecting data lakes and/or data warehouses using cloud components (AWS S3, Redshift, Aurora PostgreSQL, or equivalent Azure/GCP services)
  • Creating logical and physical data models for repositories containing heterogeneous sources
  • Optimizing data warehouses for varying workloads: high-frequency loads, large batch loads, and high-frequency queries
  • Implementing archiving, partitioning, and retention strategies to manage storage costs and compliance requirements
  • Designing strategies for slowly changing dimensions (SCD) and other data lifecycle patterns
  • Big Data / Cloud Platform Experience (2+ years)
  • Hands-on experience with Databricks or a closely comparable distributed compute platform (e.g., Azure Synapse, AWS EMR, Google Dataproc)
  • Practical experience with AWS as the primary cloud environment; familiarity with Azure and ability to extend to additional cloud providers
  • Infrastructure as Code (IaC) using Terraform for deploying and managing data pipeline components
  • Data Cataloging / Governance (2+ years)
  • Architecting a data cataloging strategy across a diverse set of sources ingested into a data lake and warehouse environment
  • Implementing a data catalog on a cloud platform
  • Familiarity with data governance concepts including data lineage, stewardship, and policy enforcement
  • System / Security Architecture (5+ years)
  • Architecting and managing security roles, access controls, and RBAC for data lakes and pipelines
  • Designing row-level and object-level security models appropriate for enterprise and federal environments

Nice To Haves

  • Databricks advanced features: Unity Catalog, Delta Live Tables (DLT), and Lakeflow/pipeline orchestration
  • Experience with Collibra or other enterprise data governance platforms (e.g., Alation, Informatica, Ataccama)
  • Familiarity with Palantir Foundry or comparable ontology-driven data platforms
  • Exposure to emerging AI/ML tooling, including LLM-based analytics, vector databases, or AI agent frameworks; willingness to rapidly learn new platforms as the program adopts them
  • Experience integrating visualization tools (Tableau, Power BI, Qlik) with data lakes and warehouses
  • Experience with data replication, high availability, and data migration tools and techniques
  • Experience designing data models for transactional applications
  • Ability to communicate effectively with senior leadership, translating complex technical concepts into actionable recommendations
  • Federal government or DoD project experience

Responsibilities

  • Leading the design of data warehouses, data lakes, and datamarts, including both logical and physical models.
  • Administering and expanding the program's Databricks platform (currently hosted in AWS), including performance tuning, access control, and platform governance.
  • Supporting the onboarding and eventual administration of the Collibra data governance platform.
  • Assisting in soliciting and assessing data requirements from consumers, analysts, visualization specialists, programmers, and engineers.
  • Optimizing data warehouse design and tuning performance for efficient loading and high-throughput queries.
  • Facilitating change control processes for the data model and integrating changes with CI/CD pipelines.
  • Designing and implementing metadata and data catalog strategies to support data discovery across the enterprise.
  • Engaging with emerging AI and analytics tools being evaluated or adopted by the program, quickly developing working knowledge sufficient to inform architecture decisions.

Benefits

  • Company-supported medical, dental, vision, life, STD, and LTD insurance
  • Benefits include 11 federal holidays and PTO
  • Eligible employees may receive performance-based incentives in recognition of individual and/or team achievements.
  • 401(k) with company matching
  • Flexible Spending Accounts for commuter, medical, and dependent care expenses
  • Tuition Assistance
  • Charitable Contribution matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service