GPU ML Architect

SamsungSan Jose, CA
8d

About The Position

Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is applied to high-performance computing devices (mobile, automotive, and other custom market segments) consumed by millions of people around the world. Come build with us! We are seeking a highly skilled GPU ML Architect to join our GPU Architecture hardware team. As a GPU ML Architect, you will be responsible for designing and developing innovative machine learning (ML) solutions for graphics processing units (GPUs). While knowledge of graphics or GPU architecture is not a requirement, a strong understanding of compute architectures and machine learning principles is essential. You will work closely with cross-functional teams to analyze workloads, model performance, and create new features for large vision models and image classification applications. You write technical specifications for new ML features and architectures, ensuring they integrate seamlessly with the graphics pipeline and meet performance, power, and functionality requirements. You work on improving middleware to enhance the overall performance and efficiency of ML workloads on GPUs. You create new hardware features and optimizations to improve PPA on ML applications such as LLMs, LVMs, Image Classification, etc. You integrate with the graphics pipeline team to ensure that ML solutions are aligned with the overall graphics architecture. You optimize GPU architectures for ML workloads, ensuring maximum performance and power efficiency. Collaborate with engineers to analyze and model workloads, identifying performance bottlenecks and areas for optimization. Our Team We’re growing a team with talented individuals and diverse skillsets to build a technology roadmap and deliver market-leading GPU product. Our Xclipse GPU is the first mobile GPU with ray tracing technology that enables console-level graphic for Samsung Galaxy smartphones. Being part of a unique growing team at a well-established global company means you have limitless room to explore, innovate, and grow by wearing different hats. Our GPU Architect team delivers whole system architecture-level design for Samsung’s current and next-generation mobile GPU. Depending on the functional discipline, they’re responsible for anywhere from early-stage architectural exploration, research work focusing on graphic and machine learning, defining new features, modeling, to diving into microarchitecture design and programming work.

Requirements

  • 15+ years of experience with a Bachelor’s degree in Computer Science/Computer Engineering/relevant technical field, or 13+ years of experience with a Master’s degree, or 11+ years of experience with a PhD
  • Strong understanding of machine learning principles and architectures
  • Experience with compute architectures and performance optimization
  • Experience with workload analysis and modeling
  • Excellent programming skills in languages such as C++, Python, or similar
  • Familiarity with LLMs, LVMs, and image classification
  • Ability to write clear and concise technical specifications
  • Strong analytical and problem-solving skills

Nice To Haves

  • GPU architecture and graphics pipeline experience is a big plus
  • Knowledge of ML frameworks and tools (e.g., TensorFlow, PyTorch) preferred

Responsibilities

  • Designing and developing innovative machine learning (ML) solutions for graphics processing units (GPUs)
  • Analyze workloads, model performance, and create new features for large vision models and image classification applications
  • Write technical specifications for new ML features and architectures, ensuring they integrate seamlessly with the graphics pipeline and meet performance, power, and functionality requirements
  • Work on improving middleware to enhance the overall performance and efficiency of ML workloads on GPUs
  • Create new hardware features and optimizations to improve PPA on ML applications such as LLMs, LVMs, Image Classification, etc.
  • Integrate with the graphics pipeline team to ensure that ML solutions are aligned with the overall graphics architecture
  • Optimize GPU architectures for ML workloads, ensuring maximum performance and power efficiency
  • Collaborate with engineers to analyze and model workloads, identifying performance bottlenecks and areas for optimization

Benefits

  • medical
  • dental
  • vision
  • life insurance
  • 401(k)
  • free onsite lunch
  • employee purchase program
  • tuition assistance (after 6 months)
  • paid time off
  • student loan program
  • wellness incentives
  • MBO bonus compensation
  • long term incentive plan
  • relocation

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service