Kforce-posted about 1 month ago
Full-time • Mid Level
Greenwood Village, CO
1,001-5,000 employees
Administrative and Support Services

We are seeking a Data Engineer with strong expertise in SQL and distributed data processing to support large-scale batch data pipelines. This role involves maintaining and optimizing existing Spark-based data jobs, as well as developing new data flows to deliver high-volume mobile data tables for analytics and business use cases.

  • Enhance, maintain, and optimize existing Spark jobs that generate large-scale data tables
  • Develop new data processing flows to meet evolving business and customer requirements
  • Write and tune complex SQL queries for high-performance data extraction and transformation
  • Manage and execute batch data pipelines (non-real time) for mobile data processing
  • Collaborate with cross-functional teams to ensure data quality, accuracy, and reliability
  • Monitor, troubleshoot, and improve data workflows for performance and scalability
  • Leverage AWS EMR and related cloud services for distributed data computation
  • 5+ years of experience in data engineering or a related field
  • Strong expertise in SQL for large-scale data manipulation and optimization
  • Hands-on experience with Spark or Hadoop for distributed data processing
  • The ideal candidate has deep technical proficiency with Spark or Hadoop, and cloud-based data solutions such as AWS EMR
  • Working knowledge of AWS, preferably EMR or similar cloud environments
  • Proficiency with Python for scripting and automation
  • Strong understanding of data pipelines, batch processing, and performance tuning
  • Experience working with mobile data or telecommunications datasets
  • Familiarity with data architecture best practices and workflow orchestration tools
  • Solid problem-solving skills and a strong attention to detail in handling large, complex datasets
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service