Data Engineer, Lead

Booz Allen HamiltonUsa, DC
18h$99,000 - $225,000

About The Position

The Opportunity: Ever-expanding technology like IoT, machine learning, and artifi cia l intelligence means that there’s more structured and unstructured data available today than ever before. As a data engineer, you know that organizing data can yield pivotal insights when it’s gathered from disparate sources. We need an experienced data professional like you to help our clients find answers in their data to impact important missions—from fraud detection to cancer research to national intelligence. As a Data Engineer at Booz Allen, you’ll use your expertise to help build advanced technology solutions and lead data engineering activities on some of the most mission-driven projects in the industry. You’ll guide data engineering activities by overseeing the development and deployment of pipelines and platforms that organize and make disparate data meaningful. Here, you’ll mentor a multi-disciplinary team of analysts, data engineers, developers, and data consumers in a fast-paced, agile environment. You’ll use your expertise in analytical exploration and data examination while you oversee the assessment, design, building, and maintenance of scalable platforms for your clients. Work with us to use data for good. Here, we focus on growing as a team to make the best solutions for our customers, so you’ll have resources for mentoring and learning new skills and tools. Join our team as we transform healthcare capabilities with cloud technology. What You’ll Work On: Design a cloud-based data solution to support the healthcare data repository. Recommend standards, tools, and capabilities based on your research of the environment and knowledge. Explore how to help customers overcome their most difficult challenges in cloud data storage. Work with the client’s technical leads to implement strategy and architecture design. Lead the planning, designing, and deployment of hybrid on-premises and cloud architecture solutions for use in a next-gen platform enabling data science and advanced analytics applications leveraging artifi cia l intelligence and machine learning, automation, cloud-based security, and big data. Join us. The world can’t wait.

Requirements

  • 10+ years of experience with programming languages such as Python, Scala, or Java
  • Experience with data validation, cleansing, and enrichment techniques to improve the accuracy and completeness of data
  • Experience building and optimizing data pipelines to extract data from various sources, transforming it into the required format, and loading using Databricks and AWS services
  • Experience designing and developing ETL workflows using tools such as Apache Spark or AWS Glue, along with monitoring and troubleshooting ETL processes to identify and resolve issues in a timely manner
  • Experience with data storage technologies and databases such as Amazon S3 or Amazon Redshift
  • Experience with Center s for Medicare & Medicaid Services (CMS) programs
  • Knowledge of ETL best practices, data integration techniques, and data quality management
  • Knowledge of Cloud database technology and cloud-native data architecture
  • Ability to obtain and maintain a Public Trust or Suitability/Fitness determination based on client requirements
  • Bachelor’s degree

Nice To Haves

  • Experience with multiple operating systems and languages and planning and implementing large-sized databases
  • Experience with UNIX or Linux, including basic commands and Shell scripting
  • Experience with a public cloud, including AWS, Micro sof t Azure, or Google Cloud
  • Experience with distributed data and computing tools, including Spark, Databricks, Hadoop, Hive, AWS EMR, or Kafka
  • Experience working on real-time data and streaming applications
  • Experience with NoSQL implementation, including MongoDB or Cassandra
  • Experience with data warehousing using AWS Redshift, MySQL, or Snowflake
  • Experience with Agile engineering practices
  • Knowledge of concepts of Data Lakehouse architecture

Responsibilities

  • Design a cloud-based data solution to support the healthcare data repository.
  • Recommend standards, tools, and capabilities based on your research of the environment and knowledge.
  • Explore how to help customers overcome their most difficult challenges in cloud data storage.
  • Work with the client’s technical leads to implement strategy and architecture design.
  • Lead the planning, designing, and deployment of hybrid on-premises and cloud architecture solutions for use in a next-gen platform enabling data science and advanced analytics applications leveraging artifi cia l intelligence and machine learning, automation, cloud-based security, and big data.

Benefits

  • health, life, disability, financial, and retirement benefits, as well as paid leave, professional development, tuition assistance, work-life programs, and dependent care.
  • recognition awards program acknowledges employees for exceptional performance and superior demonstration of our values.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service