ML Engineer, I - Acceleration Team

Torc Robotics•Ann Arbor, MI

About The Position

At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business. A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners. Now a part of the Daimler family, we are focused solely on developing software for automated trucks to transform how the world moves freight. Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer. Meet the Team: The mission of the Acceleration Team is to deploy trained Machine Learning Models on embedded hardware. This includes developing custom CUDA layers (usually guided by reference designs in Python) and implementing pre- and post-processing modules that convert raw data into model inputs and convert model outputs to usable signals. A major focus of the team is minimizing model inference latency by iteratively profiling and hand-tuning GPU kernels and C++ code. What you'll do: Develop modern C++ and CUDA code for AI inference, including data processing algorithms and custom neural network layers Optimize C++ and CUDA code guided by timing measurements and profiling to minimize processing latency Utilize existing third-party and internal frameworks, libraries and tools Work closely with other engineers and domain experts in a collaborative environment Write functional and performance tests and documentation Deliver high-quality, unit-tested, production code suitable for deployment in embedded, safety-critical environments

Requirements

Bachelor’s degree in Computer, Electrical, or Software engineering, or advanced degree
Deep understanding of memory management in C++, error handling, compilers and debuggers on Linux
Understanding of mechanisms of calling C/C++ functions from Python
Understanding of neural networks and machine learning
Strong math skills including linear algebra
Strong written and verbal technical communication skills
Positive, team player mindset

Nice To Haves

CUDA experience
Experience with deep learning frameworks such as PyTorch or TensorFlow

Responsibilities

Develop modern C++ and CUDA code for AI inference, including data processing algorithms and custom neural network layers
Optimize C++ and CUDA code guided by timing measurements and profiling to minimize processing latency
Utilize existing third-party and internal frameworks, libraries and tools
Work closely with other engineers and domain experts in a collaborative environment
Write functional and performance tests and documentation
Deliver high-quality, unit-tested, production code suitable for deployment in embedded, safety-critical environments

Benefits

Our compensation reflects the cost of labor across several geographic markets.
Pay is based on a number of factors and may vary depending on job-related knowledge, skills, and experience.
Torc's total compensation package will also include our corporate bonus and stock option plan.
Dependent on the position offered, sign-on payments, relocation, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume