Data Scientist

GRVTYMcLean, VA
Onsite

About The Position

GRVTY is seeking a Data Scientist with a TS/SCI + Poly clearance (applicable to this customer) to join one of our top projects in McLean, VA. The Data Scientist will be working in a fast-paced, dynamic, agile software development environment. The multi-disciplinary project team works together on multiple projects that includes automating processing of large forensic images, extracting and enriching metadata, and displaying resulting information in meaningful ways for analysts to conduct assessments. Team members utilize a mix of COTS and GOTS tools and technologies; as well as build integrations with a variety of external partner applications. Most solutions are cloud-based. The Sponsor adheres to Agile Scrum development methodology best practices and has 2-week sprint cycles.

Requirements

  • TS/SCI + Poly clearance
  • Demonstrated experience building production data pipelines and ETL/ELT workflows at scale
  • Demonstrated experience with Apache Spark and PySpark for distributed data processing
  • Demonstrated experience with advanced Python programming skills including data manipulation libraries (Pandas, NumPy) and data engineering best practices
  • Demonstrated experience understanding data security, privacy, governance, and compliance principles
  • Demonstrated experience with workflow orchestration tools such as Step Functions and Airflow
  • Demonstrated experience with containerization such as Docker or Podman, and deploying data applications in cloud environments
  • Demonstrated experience with AWS services (in particular S3, Lambda, and Step Functions)
  • Demonstrated experience with PostgreSQL and MySQL in production environments, including performance tuning and schema design
  • Demonstrated experience with SQL and query optimization for complex analytical workloads
  • Demonstrated experience with version control (Git) and CI/CD practices for data pipelines
  • Demonstrated experience with strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks

Responsibilities

  • Automating processing of large forensic images
  • Extracting and enriching metadata
  • Displaying resulting information in meaningful ways for analysts to conduct assessments
  • Building integrations with a variety of external partner applications
  • Working with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight
  • Solving data quality issues, pipeline failures, and performance bottlenecks

Benefits

  • Robust health plan including medical, dental, and vision
  • Health Savings Account with company contribution
  • Annual Paid Time Off and Paid Holidays
  • Paid Parental Leave
  • 401k with generous company match
  • Training and Development Opportunities
  • Award Programs
  • Variety of Company Sponsored Events
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service