Responsible for the development of high performance, distributed computing tasks using Big Data technologies such as Hadoop, NoSQL, text mining and other distributed environment technologies. Familiarity with JVM-based function languages including Scala and Clojure; Hadoop query languages including Pig, Hive, Scalding, Cascalog, PyCascading; along with alternative HDFS-based computing frameworks including Spark and STORM are desirable. Key Roles and Responsibilities: Uses Big Data programming languages and technology, writes code, completes programming and documentation, and performs testing and debugging of applications. Analyzes, designs, programs, debugs and modifies software enhancements and/or new products used in distributed, large-scale analytics and visualization solutions. Interacts with data scientists and industry experts to understand how data needs to be converted, loaded and presented. Works in a highly agile environment. Utilize AI/ML tooling and techniques to implement highly productive Data Science solutioning that spans supervised/unsupervised predictive analytics, neural networks, and deep learning. Utilize alternative HDFS-based computing frameworks including Python programming, and AI-Computer Vision. Utilize Image Processing (AI-Computer Vision) and Optical Character Recognition (OCR) model packages.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior