Data Engineer (Model Pipeline) SME

PeratonAshburn, VA
2d$112,000 - $179,000

About The Position

Peraton is seeking a Data Engineer (Model Pipeline) SME to support U.S. Customs and Border Protection (CBP) with analytics and intelligence support. This role is responsible for designing, building, and maintaining scalable data pipelines that support analytics, machine learning model pipelines, and mission-critical intelligence operations. The engineer will work with large, complex datasets to enable advanced analytics and operational decision-making across CBP mission environments. This position requires strong expertise in data engineering, ETL/ELT development, cloud platforms, and big data technologies, as well as the ability to collaborate across cross-functional teams in a federal mission environment. Support will be provided across multiple mission locations: Ashburn, VA Sterling, VA Washington, D.C.

Requirements

  • Minimum of 12 years with BS/BA; Minimum of 10 years with MS/MA. 16 years with HS diploma/equivalent can be considered in lieu of a degree.
  • 8+ years of experience in data engineering or related technical roles
  • Strong proficiency in SQL and experience with relational and NoSQL databases
  • Hands-on experience building ETL/ELT pipelines using tools such as Apache Airflow, dbt, or similar frameworks
  • Experience with cloud data platforms (AWS Redshift, Azure Synapse, Google BigQuery, or equivalent)
  • Proficiency in Python, Java, or Scala for data processing and automation
  • Experience designing scalable data architectures in distributed environments
  • Active TS/SCI clearance required
  • Ability to obtain and maintain CBP (BI) suitability
  • U.S. Citizenship required

Nice To Haves

  • Bachelor’s degree in Computer Science, Engineering, Information Systems, or related field
  • Experience with big data technologies such as Spark, Hadoop, or Kafka
  • Familiarity with containerization and orchestration tools (Docker, Kubernetes)
  • Knowledge of federal security and data governance frameworks (e.g., NIST)
  • Cloud certifications such as AWS Certified Data Analytics or Google Professional Data Engineer
  • Experience supporting DHS, CBP, or other federal agencies

Responsibilities

  • Design and maintain scalable data pipelines supporting analytics and machine learning workflows.
  • Build and optimize ETL/ELT processes for large structured and unstructured datasets.
  • Implement batch and real-time data processing pipelines to support operational and analytical use cases.
  • Develop and maintain data models and datasets that enable analytics, reporting, and machine learning.
  • Support cloud-based data platforms and modern data architectures (data lakes, warehouses, and hybrid environments).
  • Utilize distributed and streaming technologies (e.g., Spark, Kafka) for large-scale data processing.
  • Ensure data quality, governance, security, and compliance with federal data protection requirements.
  • Collaborate with engineers, analysts, and stakeholders to deliver scalable, analytics-ready data solutions.
  • Document pipeline architecture and contribute to Agile development and CI/CD practices.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service