Senior Data Scientist

General Dynamics Information TechnologyWashington, DC
1dHybrid

About The Position

GDIT is seeking Senior Data scientists for a federal client in Washington D.C. area. MEANINGFUL WORK AND PERSONAL IMPACT Designing, building, and managing the infrastructure and tools needed to collect, store, process, and analyze large volumes of data Data Collection: Gathering data from various sources, such as databases, APIs, and Internet of Things (IoT) devices Data Storage: Using scalable storage solutions like data lakes and distributed file systems to handle vast amounts of data Data Processing: Transforming raw data into a usable format through batch processing (e.g., Hadoop) or real-time processing (e.g., Apache Kafka) Data Integration: Combining data from different sources to create a unified view Data Quality: Ensuring the accuracy, consistency, and reliability of data Data Security: Implementing measures to protect data from unauthorized access and breaches Data Pipeline Management: Automating and orchestrating data workflows to ensure smooth data flow from source to destination with subsequent training once pipelines are setup for any super-users Storing and analyzing large datasets utilizing advanced techniques such as statistical analysis, econometrics, Machine Learning (ML), and predictive modeling with multiple scripting options such as R, Python, SAS, Stata, and SQL Support of varying transfer methods (direct cloud upload, secure FTP, or physical media) from diverse sources (transactional DB, operational data stores, external SaaS, flat files, legacy mainframe) Preprocessing that may include decompression, deduplication, batch-based ingestion, near real time streaming WHAT YOU’LL NEED TO SUCCEED Deep knowledge of big data and other COTS statistical and analytical tools (R, SAS, Stata and data lake tools), database management, and ETL processes Expertise in data architecture, data science tools, AI, and data lakes to facilitate successful project execution Strong background in statistics and mathematics, proficiency in programming (e.g., Python, Java), experience with machine learning algorithms, and data visualization tools (e.g., Tableau, Matplotlib) Comparative understanding of leading models (e.g., Claude Code, ChatGPT, xAI), including their capabilities, limitations, and trade-offs (e.g., latency, cost, fine-tuning, context window size) Experience deploying and managing LLMs in FedRAMP-authorized environments, including GCC, GovCloud, or other secure cloud infrastructures Current Certified Analytics Professional (CAP) Certification Current Principal Data Scientist (PDS) Certification Security clearance level: Candidates must be eligible to obtain a Public Trust level clearance. Ability to obtain and maintain a Public Trust or higher and authorization to work in the United States. Work visa sponsorship will not be provided for this position. US citizenship Preferred. GDIT IS YOUR PLACE At GDIT, the mission is our purpose, and our people are at the center of everything we do. Growth: AI-powered career tool that identifies career steps and learning opportunities Support: An internal mobility team focused on helping you achieve your career goals Rewards: Comprehensive benefits and wellness packages, 401K with company match, and competitive pay and paid time off Flexibility: Full-flex work week to own your priorities at work and at home with customer approval Community: Award-winning culture of innovation and a military-friendly workplace OWN YOUR OPPORTUNITY Explore a career in data science and engineering at GDIT and you’ll find endless opportunities to grow alongside colleagues who share your determination for solving complex data challenges. #GDIT The likely salary range for this position is $147,292 - $199,278. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range. Scheduled Weekly Hours: 40 Travel Required: Less than 10% Telecommuting Options: Hybrid Work Location: USA DC Washington Additional Work Locations: Total Rewards at GDIT: Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most. We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology. Join our Talent Community to stay up to date on our career opportunities and events at gdit.com/tc. Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans Join our 30,000 everyday heroes. We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 30 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology. For more information about GDIT's Privacy Policy, click here: https://www.gdit.com/privacy-policy/notices/

Requirements

  • Deep knowledge of big data and other COTS statistical and analytical tools (R, SAS, Stata and data lake tools), database management, and ETL processes
  • Expertise in data architecture, data science tools, AI, and data lakes to facilitate successful project execution
  • Strong background in statistics and mathematics, proficiency in programming (e.g., Python, Java), experience with machine learning algorithms, and data visualization tools (e.g., Tableau, Matplotlib)
  • Comparative understanding of leading models (e.g., Claude Code, ChatGPT, xAI), including their capabilities, limitations, and trade-offs (e.g., latency, cost, fine-tuning, context window size)
  • Experience deploying and managing LLMs in FedRAMP-authorized environments, including GCC, GovCloud, or other secure cloud infrastructures
  • Candidates must be eligible to obtain a Public Trust level clearance.
  • Ability to obtain and maintain a Public Trust or higher and authorization to work in the United States.
  • US citizenship Preferred.

Nice To Haves

  • Current Certified Analytics Professional (CAP) Certification
  • Current Principal Data Scientist (PDS) Certification

Responsibilities

  • Designing, building, and managing the infrastructure and tools needed to collect, store, process, and analyze large volumes of data
  • Data Collection: Gathering data from various sources, such as databases, APIs, and Internet of Things (IoT) devices
  • Data Storage: Using scalable storage solutions like data lakes and distributed file systems to handle vast amounts of data
  • Data Processing: Transforming raw data into a usable format through batch processing (e.g., Hadoop) or real-time processing (e.g., Apache Kafka)
  • Data Integration: Combining data from different sources to create a unified view
  • Data Quality: Ensuring the accuracy, consistency, and reliability of data
  • Data Security: Implementing measures to protect data from unauthorized access and breaches
  • Data Pipeline Management: Automating and orchestrating data workflows to ensure smooth data flow from source to destination with subsequent training once pipelines are setup for any super-users
  • Storing and analyzing large datasets utilizing advanced techniques such as statistical analysis, econometrics, Machine Learning (ML), and predictive modeling with multiple scripting options such as R, Python, SAS, Stata, and SQL
  • Support of varying transfer methods (direct cloud upload, secure FTP, or physical media) from diverse sources (transactional DB, operational data stores, external SaaS, flat files, legacy mainframe)
  • Preprocessing that may include decompression, deduplication, batch-based ingestion, near real time streaming

Benefits

  • Comprehensive benefits and wellness packages
  • 401K with company match
  • paid time off
  • full flex work weeks where possible
  • medical plan options, some with Health Savings Accounts
  • dental plan options
  • vision plan
  • paid parental
  • military
  • bereavement and jury duty leave
  • short and long-term disability benefits
  • life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service