This role involves conducting in-depth research into the underlying hardware logic of various AI accelerators, evaluating power-efficiency and suitability of heterogeneous architectures for Large Language Model (LLM) inference and training. It also includes designing and optimizing high-performance operator libraries for large-scale cloud computing environments, resolving latency issues in hardware scheduling, memory management, and distributed communication. The engineer will define interconnect architecture, driving virtualization, standardized access, and efficient pooling of heterogeneous computing resources in the cloud. Additionally, the role requires monitoring global trends in semiconductors and accelerators, performing feasibility studies and experimental validation for implementing emerging technologies within cloud infrastructure. Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior