Data Engineering Intern

DomainToolsWashington, DC
Remote

About The Position

DomainTools is seeking a R&D Data Engineering Intern. This role is intended for those seeking to hone their development and data analysis skills as they begin their career. A successful candidate will be well organized, collaborative, and experienced in working with remote teams. As our intern, you will be part of a critical team supporting production machine learning pipelines, providing development and ad-hoc support to our business. Responsibilities include: assisting in data hygiene projects and documentation to maintain data integrity, automating processes, and supporting the R&D team on other special projects as needed. These valuable opportunities provide hands-on experience, allowing you to put your educational knowledge into action and lay a solid foundation for your future. The role provides an excellent learning opportunity specifically for those interested in internet security, machine learning, and production development patterns. The right person for us will have a support oriented mentality, wanting to enable organizations to make better business decisions and improve efficiency.

Requirements

  • Strong Organizational Skills: Ability to manage your tasks and schedule effectively.
  • A Strong Attention to Detail: A sharp eye for spotting inconsistencies in data and a commitment to high-quality documentation.
  • Clear and Precise Communication Skills: The ability to share updates and collaborate effectively with a remote team.
  • Familiarity with python, git.
  • Ideally also familiar with Spark/PySpark.

Nice To Haves

  • Knowledge of computer networks, including DNS, domain names, and IP addresses

Responsibilities

  • Assisting in data hygiene projects and documentation to maintain data integrity.
  • Automating processes.
  • Supporting the R&D team on other special projects as needed.
  • Data cleaning and preparation to ensure our machine learning pipelines remain accurate and reliable.
  • Update and maintain code to ensure our system stays compatible with evolving data sources.
  • Develop and improve tools to monitor data health and help the team explore new ways to use our datasets.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service