Groundswell-posted 12 days ago
Full-time • Mid Level
Onsite • McLean, VA
251-500 employees

Who Are We? Groundswell is a premier technology integrator resolutely committed to solving the most complex challenges facing federal agencies today. Our name, Groundswell, represents our commitment to be an unstoppable, seismic change in government. Ours is a small company culture with big company reach and results . Are you ready to be audacious, be bold and drive change at a rapid pace ? Join us, where w e’ll make a greater impact together. What You'll Do: Who Are We? Groundswell is a premier technology integrator resolutely committed to solving the most complex challenges facing federal agencies today. Our name, Groundswell, represents our commitment to be an unstoppable, seismic change in government. Ours is a small company culture with big company reach and results . Are you ready to be audacious, be bold and drive change at a rapid pace ? Join us, where w e’ll make a greater impact together. What You'll Do: The Data Engineer – GenAI Pipelines will play a critical role in building, optimizing, and securing data pipelines that power AI/ML and Generative AI solutions for federal customers. This role spans data ingestion, validation, compliance, and large-scale pipeline optimization to support both enterprise analytics and LLM training/inference workflows.

  • Design, develop, and maintain ETL/ELT processes for multi-source data ingestion.
  • Seamlessly onboard and integrate new data sources into existing pipelines.
  • Build automated data validation frameworks to ensure accuracy and consistency.
  • Optimize pipelines for scalability and performance across cloud platforms.
  • Implement data quality monitoring and alerting systems.
  • Architect data solutions across AWS, Azure, and GCP.
  • Build data pipelines tailored to Large Language Model (LLM) training and inference workflows.
  • Develop preprocessing pipelines for GenAI-specific use cases.
  • Implement data privacy, security best practices, and compliance controls (e.g., FedRAMP).
  • Maintain data lineage, audit trails, and governance standards.
  • Write complex SQL queries to support data analysis and reporting.
  • Support federal proposal efforts with technical expertise on data solutions.
  • Optimize cloud resource utilization and cost management.
  • Bachelor’s degree in Computer Science, Computer Engineering, Mathematics, Statistics, or related technical field.
  • 5+ years of professional data engineering experience.
  • Strong expertise in SQL.
  • Proficiency in at least one programming language: Python, Java, or Scala.
  • Experience with modern ETL frameworks (e.g., Apache Airflow, AWS Glue).
  • Hands-on experience with at least one major cloud provider (AWS, Azure, GCP).
  • Experience implementing data quality frameworks and validation processes.
  • Must be a U.S. Citizen per contract requirements
  • Must be local to the DC Metro area and able to be onsite in McLean, VA 5 days a week
  • Comprehensive medical, dental, and vision plans
  • Flexible Spending Account
  • 4% 401K Match (immediate vesting)
  • Paid Time Off
  • Tuition reimbursement, certification programs, and professional development
  • Flexible work schedule
  • On-site gym and childcare option
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service