Machine Learning System Software Architect

Baidu USASunnyvale, CA
Onsite

About The Position

We are looking for a world-class Machine Learning System Software Architect to join our SoC team at Baidu’s Sunnyvale office. The successful candidate will be a motivated self-starter who will thrive in this highly technical environment. Your job responsibilities as a Machine Learning System Software Architect will help the team to architect and create high-performance machine learning system software and build the distributed AI training system by connecting thousands of Kunlun Accelerators and servers. Create differentiated architectural innovations for Baidu’s Kunlun AI SoC roadmap. Architect, simulate, and design amazing machine learning solutions for our AI machine learning products. Develop system-level ML architectures that push the boundaries of performance, power, and latency; and collaborate closely with many other teammates to ensure we design and optimize hardware and software for maximum performance. Monitor industrial and academic trends in artificial intelligence and determine where they should intersect our roadmaps. Drive partnerships for access to the most advanced AI technologies Evaluate the power, performance, and cost of prospective architecture and subsystems. Build scalable tools for modeling and performance evaluation. Engage with system and application software engineers to ensure optimization of the entire hardware/software stack. Work with SoC design, verification, and validation engineers to execute the architecture.

Requirements

  • Knowledge of Machine Learning market, technological and business trends, software ecosystem, and emerging applications
  • Proven track record of 5+ years architecting software solutions for Machine Learning, acceleration and optimization, especially in large distributed training system and HPC area
  • Experience with deep learning frameworks: TensorFlow/PyTorch/PaddlePaddle, etc.
  • Strong track record of outreach to ML researchers and application developers
  • Experience with CPUs, GPUs, memory systems, and accelerators
  • Experience with performance simulation and modeling in C++
  • MS or PhD in Electrical or Computer Engineering
  • Excellent communication skills in both English and Chinese

Responsibilities

  • Architect and create high-performance machine learning system software
  • Build the distributed AI training system by connecting thousands of Kunlun Accelerators and servers
  • Create differentiated architectural innovations for Baidu’s Kunlun AI SoC roadmap
  • Architect, simulate, and design machine learning solutions for AI machine learning products
  • Develop system-level ML architectures that push the boundaries of performance, power, and latency
  • Collaborate closely with teammates to ensure we design and optimize hardware and software for maximum performance
  • Monitor industrial and academic trends in artificial intelligence and determine where they should intersect our roadmaps
  • Drive partnerships for access to the most advanced AI technologies
  • Evaluate the power, performance, and cost of prospective architecture and subsystems
  • Build scalable tools for modeling and performance evaluation
  • Engage with system and application software engineers to ensure optimization of the entire hardware/software stack
  • Work with SoC design, verification, and validation engineers to execute the architecture

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Number of Employees

5,001-10,000 employees

© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service