This is a hybrid position and must work in-office 1-2 days/week. May telecommute 2-3 days/week within commuting distance of Wayne, PA office location to be able to attend meetings or work onsite when needed, often on short notice. Provide Data engineering and build data pipelines on the Data Lake/Hadoop platform (SQL/Impala, Java, Scala/Spark). Design microservices architecture and deploy on dockers in private and public clouds and containers in Kubernetes environment. Architect Machine Learning pipelines and optimize data architecture for consumption, utilization and analytics for data science, machine learning and statistical use cases. Contribute to the creation of data lake store strategy consistent with standards, principals, theories, and concepts to ensure rapid delivery. Work with data architects on logical data models and physical database designs optimized for performance, availability and reliability. Tune and optimize backend and frontend data operations. Oversee the development of continuous integration and continuous deployment methodologies through the implementation of Maven, Jenkins, Git, and Nexus to deploy the codebase. Mentor development team members and inform management of work activities and schedules. Assess new initiatives to determine work effort and estimate time to completion. Mentor and assist lower-level architects and business analysts. Create detailed documentation for deployment processes, ensuring smooth and efficient implementation. Stay updated with the latest technological trends and advancements in the industry. Requires supervisory responsibilities for data engineers or related positions.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior