The Data Flow Engineer will be responsible for defining, designing, implementing, and maintaining complex data flows, primarily using Cloudera DataFlow (Apache NiFi). This role involves developing ingestion, transformation, routing, and egress pipelines, as well as building and optimizing real-time and near-real-time CDC pipelines. The engineer will integrate external systems, manage data schemas, and ensure reliable data delivery. Additionally, the position requires configuring and managing data governance and security using Apache Atlas and Apache Ranger, monitoring pipeline performance, and collaborating with stakeholders. The role also includes creating technical documentation and participating in system upgrades.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
Associate degree