GovCIO-posted 2 months ago
Full-time • Mid Level
Onsite • Albany, NY
1,001-5,000 employees

GovCIO is currently hiring for a Data Scientist to provides technical leadership in the data science domain. This position will be located in Rome, NY and will be an onsite position. Responsibilities Develops deployable AI/ML analytics for C5ISRT data and provides technical leadership in the data science domain. Transforms raw sensor and operational data into analyst-ready insights. Develops and deploys algorithms, use cases, and models. Supports data analytics research, development, prototyping, maturation, testing, and integration, including generating large-scale labeled datasets and developing Labeled Data Campaign Plans. Applies MLOps ecosystem tools, including NVIDIA Triton Inference servers, TensorFlow, PyTorch, ONNX, and Python. Integrates models across domains with bi-weekly evaluations and dataset challenges. Operationalizes data curation, tagging, and ‘Labeling on the Line' workflows and integrates into Government-owned labeling apps and repositories. Contributes to CDRL-backed artifacts such as Detailed Analyses (A021/B021, C021). Skills: data engineering, feature development, MLOps packaging, experiment tracking, evaluation design, metric selection, and bias/error analysis with defensible reporting. Knowledge: MLOps at IL5/SIPR/JWICS, dataset curation at scale, evaluation pipelines for bi-weekly model submissions, MLOps ecosystem (e.g., Triton Inference Server, TensorFlow, PyTorch, ONNX, Python) for model execution, data security classifications, data lake management and data replication to/from edge systems, and algorithm rights/licensing limits in operations.

  • Develops deployable AI/ML analytics for C5ISRT data and provides technical leadership in the data science domain.
  • Transforms raw sensor and operational data into analyst-ready insights.
  • Develops and deploys algorithms, use cases, and models.
  • Supports data analytics research, development, prototyping, maturation, testing, and integration, including generating large-scale labeled datasets and developing Labeled Data Campaign Plans.
  • Applies MLOps ecosystem tools, including NVIDIA Triton Inference servers, TensorFlow, PyTorch, ONNX, and Python.
  • Integrates models across domains with bi-weekly evaluations and dataset challenges.
  • Operationalizes data curation, tagging, and ‘Labeling on the Line' workflows and integrates into Government-owned labeling apps and repositories.
  • Contributes to CDRL-backed artifacts such as Detailed Analyses (A021/B021, C021).
  • Skills: data engineering, feature development, MLOps packaging, experiment tracking, evaluation design, metric selection, and bias/error analysis with defensible reporting.
  • Knowledge: MLOps at IL5/SIPR/JWICS, dataset curation at scale, evaluation pipelines for bi-weekly model submissions, MLOps ecosystem (e.g., Triton Inference Server, TensorFlow, PyTorch, ONNX, Python) for model execution, data security classifications, data lake management and data replication to/from edge systems, and algorithm rights/licensing limits in operations.
  • Clearance Required:TS/SCI
  • Masters Degree
  • Working knowledge of Python Programming for ML, Cloud Computing Fundamentals, Data Curation Best Practices, Deep Learning Frameworks (TensorFlow/PyTorch), GPU Acceleration, Secure Coding Practices, C5ISRT Domain Expertise, Ethical AI/Bias Analysis
  • DoDD 8140 IAT Level II: (CompTIA Security+, or GSEC or ISC2 SSCP)
  • Bachelor's with 15+ years (or commensurate experience)
  • AWS/Azure Certified Data Engineer or ML Specialty
  • Certified Analytics Professional (CAP)
  • Employee Assistance Program (EAP)
  • Corporate Discounts
  • Learning & Development platform, to include certification preparation content
  • Training, Education and Certification Assistance
  • Referral Bonus Program
  • Internal Mobility Program
  • Pet Insurance
  • Flexible Work Environment
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service