Data Scientist 3

GormatAnnapolis Junction, MD
20h

About The Position

We are seeking a Data Scientist with a strong background in automation for reporting purposes. You will complete data transformation and modeling using SQL and Python (Pandas, PySpark, or Polars is preferred) and design scalable ETL workflows, implementing incremental and batch data ingestion with robust data validation. You will also preprocess, minimize, filter, normalize, aggregate, and reshape datasets for reporting and maintain Python automation packages to be utilized by DOD personnel. The Level 3 Data Scientist shall possess the following capabilities: Foundations: (Mathematical, Computational, Statistical). Data Processing: (Data management and curation, data description and visualization, workflow and reproducibility). Modeling, Inference, and Prediction: (Data modeling and assessment, domain-specific considerations). Ability to make and communicate principal conclusions from data using elements of mathematics, statistics, computer science, and applications-specific knowledge. Ability to use analytic modeling, statistical analysis, programming, and/or another appropriate scientific method, develop and implement qualitative and quantitative methods for characterizing, exploring, and assessing large datasets in various states of organization, cleanliness, and structure that account for the unique feature and limitations inherent in Government data holdings. Translate practical mission needs and analytic questions related to large datasets into technical requirements and, conversely, assist others with drawing appropriate conclusions from the analysis of such data. Effectively communicate complex technical information to non-technical audiences. Primary success marker is experience with process automation for reporting. Includes designing scalable extract transform load (ETL) workflows, implementing incremental and batch data ingestion with robust data validation. Candidate will complete data transformation & modeling using SQL and Python (Pandas, PySpark, or Polars is preferred). Requires preprocessing, minimizing, filtering, normalizing, aggregating, and reshaping datasets for reporting. Automating dashboards using Power BI, Tableau etc is a plus. Candidate will maintain python automation packages to be utilized by agency personnel.

Requirements

  • Bachelor's Degree with 10 years of relevant experience, associate's degree with 12 years of experience may be considered for individuals with in-depth experience that is clearly related to the position.
  • Bachelor's Degree must be in Mathematics, Applied Mathematics Statistics, Applied Statistics, Machine learning, Data Science, Operations Research, or Computer Science or a degree in a related field (Computer Information Systems, Engineering), a degree in the physical/hard sciences (e.g. physics, chemistry, biology, astronomy), or other science disciplines with a substantial computational component (i.e. behavioral, social, or life) may be considered if it included a concentration of coursework (5 or more courses) in advanced Mathematics (typically 300 level or higher, such as linear algebra, probability and statistics, machine learning) and/or computer science (e.g. algorithms, programming, , data structures, data mining, artificial intelligence).
  • Broader range of degrees will be considered if accompanied by a Certificate in Data Science from an accredited college/university.
  • Relevant experience must be in designing/implementing machine learning, data science, advanced analytical algorithms, programming (skill in at least on high level language (e.g. Python), statistical analysis (e.g. variability, sampling error, inference, hypothesis testing, EDA, application of linear models), data management (e.g. data cleaning and transformation), data mining, data modeling and assessment, artificial intelligence, and/or software engineering.
  • Experience designing ETL workflows.
  • Experience with SQL and Python (Pandas, PySpark, or Polars preferred).
  • TS/SCI with polygraph is required.

Nice To Haves

  • Experience automating dashboards using PowerBI or Tableau is a plus.

Responsibilities

  • Complete data transformation and modeling using SQL and Python (Pandas, PySpark, or Polars is preferred)
  • Design scalable ETL workflows, implementing incremental and batch data ingestion with robust data validation
  • Preprocess, minimize, filter, normalize, aggregate, and reshape datasets for reporting
  • Maintain Python automation packages to be utilized by DOD personnel
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service