What you'll do... Position: Staff Data Engineer Job Location: 1375 Crossman Avenue, Sunnyvale, CA 94089 Duties: Design, build, and maintain scalable data pipelines and infrastructure for large-scale data processing and analytics using technologies such as Hadoop, Spark, distributed event store and stream-processing platform, in-memory databases, and other big data tools. Build large-scale distributed event streaming platforms such as Apache Kafka and Google Cloud Pub/Sub. Develop and deploy real-time data processing pipelines for near-real-time (NRT) streaming data. Design and implement data storage solutions using Data Lakes and NoSQL databases to support high-volume and high-velocity data processing. Develop and implement machine learning models to support predictive analytics and automation. Develop and deploy natural language processing (NLP) models using chatgpt and other Gen AI focused tools and platforms. Work with data scientists, analysts, and other stakeholders to understand data requirements and develop solutions that meet their needs. Develop and maintain data quality and governance processes to ensure data accuracy, completeness, and consistency across different systems and sources. Design and implement job scheduling and automation using scheduling tools. Optimize data processing workflows using managed services provided by cloud platforms. Identify and resolve performance bottlenecks, data quality issues, and other technical challenges that arise in large-scale data processing environments. Create and maintain documentation and best practices for data engineering processes and systems. Stay up-to-date with the latest trends and innovations in big data, cloud computing, and related technologies, and adapt these technologies to improve data processing and analytics capabilities. Build data models to support data visualization and analysis. Develop and maintain data pipelines to extract, transform, and load data from various sources into the data visualization tool. Build and maintain dashboards and reports to provide insights into business performance and trends. Develop and maintain data validation and testing procedures to ensure data accuracy.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level