The Data Engineer will be writing code that moves data through a pipeline, fixing data pipeline issues, optimizing data systems, and collaborating with stakeholders, day to day. The Capital Software Project is building a data fabric system. The data fabric system includes a data pipeline that will be moving complex mix of data types and formats to a data lake and a series of databases. Allowing mission engineering analyst to conduct data analysis from authoritative sources of truth. The Data Engineer will be working with the Systems Engineering and DevSecOps teams to ensure that datasets are properly converted, transformed, routed and stored. Create ETL/ELT code to ingest, transform, and load data from a variety of sources into the data lake and scale. Primary data sources will be mission engineering data and physics-based simulation results. How you will contribute to our National Security and Defense mission: As a Data Engineer will be working with the Systems Engineering and DevSecOps teams to ensure that datasets are properly converted, transformed, routed and stored, as designed. You will also bring your experience in harvesting metadata from datasets. You’ll bring your experience in multi-source and cross-domain data integration. Day-to-day, you will be blending your software engineering, data architecture, and operational monitoring skills to: Build and Maintain Data Pipelines to: • Write or update ETL/ELT code that ingests data from APIs, databases, or files. • Implement transformations to clean, standardize, or enrich data. • Optimize pipeline performance and improve reliability. • Hands-on experience with AI and ML pipelines, MLOps workflows, and data processing methodologies. • Implement methods for data traceability and lineage. Monitor Pipelines and Data Quality to: • Check automated alerts and logs for failed jobs or data anomalies. • Troubleshoot issues such as schema changes, missing data, or performance bottlenecks. • Validate data accuracy and consistency across systems. Leverage your expertise in semantic modeling Collaborate with Engineering Teams to: • Meet with project teams to understand new data requirements. • Work with backend engineers or DevSecOps to align pipeline changes with system updates. • Support stakeholders by providing data extracts or explanations of data structure. Design and Improve Data Architecture • Plan new data models or schemas for analytics or application use case. • Incorporate metadata strategies tailored fit for the use case. • Help shape the data warehouse/lake structure. • Evaluate new tools or cloud services to improve scalability and maintainability. Write Code and Documentation • Build reusable application containers in Python, SQL, or other languages used in the stack. • Document pipeline logic, data contracts, and system diagrams. • Participate in code reviews to ensure quality and best practices.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior