Data Engineer

eTelligent Group LLCWashington, DC
8hRemote

About The Position

The Data Engineer is responsible for discovering, validating, transforming, and integrating IRS data sources to support API development and downstream analytics. This role ensures data accuracy, consistency, and availability across legacy and modern systems, enabling reliable API-driven data access.

Requirements

  • Bachelor’s degree in Computer Science, Data Engineering, or related field, or equivalent experience.
  • 3 to 5 or more years of experience in data engineering or backend data systems.
  • Strong skills in SQL and Python or Scala.
  • Experience building data pipelines using tools such as Spark, Airflow, Kafka, or Databricks.
  • Experience working with relational and NoSQL databases.
  • Familiarity with cloud-based data platforms.
  • Strong communication abilities.
  • US Citizen (MUST)
  • Must be eligible to possess MBI (IRS Background Investigation) clearance. Active MBI clearance is preferred.

Nice To Haves

  • Experience supporting analytics, AI, or machine learning pipelines.
  • Familiarity with data governance, metadata management, or data cataloguing.
  • Experience working with federal data systems or regulated data environments.
  • Ability to contribute to growth & business development activity as necessary.

Responsibilities

  • Discover and validate legacy and modern IRS data sources.
  • Design and implement ETL and ELT pipelines supporting API services.
  • Map and document data elements and object models.
  • Ensure consistency between source data, object definitions, and API outputs.
  • Support integration with modern databases, data lakes, and flat file sources.
  • Implement data quality checks and validation routines.
  • Collaborate with API developers to ensure data models support API design.
  • Support automated testing and regression validation of data-driven services.
  • Contribute to data documentation, validation logs, and performance reports.
  • Demonstrated ability/evidence of high volume of accurate code production.
  • Competency in Openshift Serverless, AWS Serverless, Kubernetes, GitHub, and JIRA.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service