Design, develop, and maintain scalable and efficient data pipelines utilizing advanced technologies such as Python, PySpark, and Databricks. Integrate data from diverse sources while ensuring high standards of data quality, consistency, and reliability. Formulate and implement comprehensive data architecture strategies, encompass data modeling, schema design, and data storage solutions, as well as optimizing data processing workflows for enhanced performance, scalability, and cost-efficiency. Collaborate with data scientists, analysts, and stakeholders is essential to accurately understand data requirements and deliver tailored data solutions. Identify and resolve data-related issues, support data infrastructure, and maintain detailed documentation of all data pipelines, architecture, and processes. Drive the design and implementation of data models to enhance business decision-making by generating insights from both internal and external data assets. Define data requirements, mine and validate large-scale structured and unstructured datasets using cloud-based tools and supporting both standard and customized data analyses. Develop robust mechanisms for data ingestion, analysis, validation, normalization, and cleaning alongside upholding best practices in data engineering and contributing to advanced data analytics and visualization initiatives.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior