The Big Data Lead will be responsible for implementing and managing big data processing workflows, including ETL pipelines and data transformations. This role involves ensuring data quality and integrity, troubleshooting PySpark applications, and integrating PySpark code with various frameworks. The lead will also be responsible for documenting code lineage and ensuring compliance with data security and privacy regulations. The position requires strong programming skills in Python and SQL, extensive experience with big data technologies like Spark, Hadoop, Hive, and Kafka, and an understanding of data warehousing concepts and DevOps practices. Excellent problem-solving, analytical, communication, and leadership abilities are essential.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed
Number of Employees
5,001-10,000 employees