The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale. SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the SambaNova Suite is a fully integrated platform, delivered on-premises or in the cloud, combined with state-of-the-art open-source models that can be easily and securely fine-tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets. The Opportunity The Runtime team at Sambanova is a seasoned engineering team with a proven track record of delivering cutting-edge system software solutions for AI and machine learning applications in the enterprise & commercial landscape. Runtime is responsible for the lowest levels of the SambaNova stack, efficiently interacting with the hardware to provide the best application performance and maximize hardware utilization. We handle all aspects of software infrastructure to enable higher level applications, including: High performance user libraries Operating System interface/integration Data model manipulation for scaling Networking/communication intra and inter node Orchestration of partitioned workloads Error monitoring and tools for system management and observability We build a high performance, distributed and scalable software execution environment for SambaNova DataScale & Cloud platforms to support data-flow applications such as ML training and inference and HPC applications. We are searching for a software engineer who will work on all parts of the runtime stacks, supporting AI, ML, and scientific applications in high-performance distributed systems. You will participate in building, testing and deploying next-generation high-performance compute systems for AI applications at scale. We expect the candidate to have a strong background in programming, building and testing software in distributed systems, performance tuning of large scale systems, and good teamwork and planning skills.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
101-250 employees