Data Engineer

World Wide TechnologyMansfield, TX
6d$150,000 - $200,000

About The Position

World Wide Technology, LLC has an opportunity available for a Data Engineer to support our Government Services team in developing an AI model to evaluate data integrity and identify errors for evaluation and resolution. The ideal candidate will have strong skills in DevSecOps practices, with hands-on experience in infrastructure automation, configuration management, and CI/CD pipelines.

Requirements

  • Bachelor's degree in Computer Science or related field, or equivalent experience.
  • 3-5 years of experience in data engineering.
  • Strong proficiency in programming languages like Python for data processing pipeline development.
  • Experience with distributed data processing frameworks (e.g., Apache Spark) and SQL for handling large-scale datasets.
  • Proficiency in working with databases (SQL and NoSQL) and data storage systems (e.g., Oracle Database, Delta Lake).
  • Experience working with vector databases or similar technologies (e.g., Pinecone, FAISS).
  • Familiarity with orchestration tools (e.g., Apache Airflow, Prefect) for data workflows.
  • Understanding of data security, governance, and compliance best practices.
  • Experience with cloud platforms (e.g., AWS, Azure, GCP) and their data services (e.g., Azure AI Search, BigQuery, Redshift).
  • Proficiency with common DevOps practices, including CI/CD pipelines and containerization (Docker).
  • Strong conceptual problem-solving

Responsibilities

  • Design and construct scalable data pipelines and ETL processes for ingesting and transforming structured and unstructured data, using OCR, large language models, vision-language models, and reasoning agents
  • Build and optimize modern data storage cloud environment, including cloud-based data lakes and warehouses, to support advanced analytics and GenAI workflows.
  • Implement data integration workflows that prioritize performance, security, and scalability while ensuring data quality and governance.
  • Collaborate with data scientists and machine learning engineers to prepare high-quality datasets for model development, training, and inference.
  • Monitor and troubleshoot data pipeline performance, addressing bottlenecks and failures to ensure reliability and scalability.
  • Employ DevOps practices to set up CI/CD pipelines, automate testing, and ensure reliable deployment of data workflows and services.

Benefits

  • Health and Wellbeing: Health, Dental, and Vision Care, Onsite Health Centers, Employee Assistance Program, Wellness program
  • Financial Benefits: Competitive pay, Profit Sharing, 401k Plan with Company Matching, Life and Disability Insurance, Tuition Reimbursement
  • Paid Time Off: PTO and Sick Leave (starting at 20 days per year) & Holidays (10 per year), Parental Leave, Military Leave, Bereavement
  • Additional Perks: Nursing Mothers Benefits, Voluntary Legal, Pet Insurance, Employee Discount Program
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service