Data Product Developer

Booz Allen HamiltonReston, VA
1d$62,000 - $141,000

About The Position

Data Product Developer The Opportunity: As a Data Product Developer on the Databricks platform at Booz Allen, you'll turn complex mission data into trusted, consumable data products that empower analysts, decision-makers, and AI/ML teams across defense, intelligence, and civil domains. You'll focus on designing, building, and iterating on reusable data assets — from governed medallion architectures to self-service analytics layers — using Databricks as the core Lakehouse platform. Join a team that accelerates national security outcomes by delivering high-quality, discoverable, and reliable data products at scale. Help clients achieve data-driven superiority through modern engineering practices on Delta Lake, Unity Catalog, and Databricks Workflows. Due to the nature of work performed within this facility, U.S. citizenship is required. Work with us to use data for good. What You'll Work On:

Requirements

  • 2+ years of experience in data engineering, software development, or analytics engineering
  • Experience writing SQL queries for data analysis and transformation
  • Experience with Python programming such as scripting and data manipulation
  • Knowledge of distributed data processing concepts such as Spark fundamentals, including partitioning, caching, and shuffle
  • Ability to work in a team environment and communicate technical concepts to non-technical stakeholders
  • Ability to obtain and maintain a Public Trust or Suitability/Fitness determination based on client requirements
  • Bachelor’s degree in Computer Science, Engineering, Information Systems, or Mathematics

Nice To Haves

  • Experience with Databricks, including notebooks, Delta Lake, Workflows, or DLT in academic, internship, personal project, or professional environments
  • Experience with cloud platforms such as Azure, AWS, or GCP and object storage
  • Experience with orchestration tools such as Databricks Workflows or Airflow or dbt
  • Experience with streaming data processing or real-time pipelines
  • Knowledge of Git for version control and collaborative development
  • Knowledge of data governance concepts, including Unity Catalog and access controls
  • Databricks Certified Data Engineer Associate Certification

Responsibilities

  • Design and develop data products such as curated datasets, feature stores, analytics-ready tables, and governed views in Databricks bronze, silver, and gold layers
  • Build end-to-end pipelines with Delta Live Tables (DLT), PySpark, Spark SQL, and Delta Lake to create reliable, incremental, and quality-assured data assets
  • Implement Unity Catalog governance features such as, tags, access controls, and lineage tracking to ensure data products are secure, discoverable, and compliant
  • Collaborate with mission stakeholders, data analysts, data scientists, and product owners to define requirements, iterate on data product features, and align with business and mission value
  • Apply medallion architecture patterns and data quality and validation frameworks to produce trustworthy, production-grade data products
  • Monitor usage, performance, and cost of data products, optimizing with Photon engine and serverless compute where appropriate
  • Contribute to documentation, demos, and knowledge sharing so clients can self-serve and extend data products
  • Learn and incorporate emerging Databricks capabilities such as Lakeflow and AI-powered tools to evolve data products faster

Benefits

  • health, life, disability, financial, and retirement benefits, as well as paid leave, professional development, tuition assistance, work-life programs, and dependent care.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service