Sr CPU Performance Engineer - Display

QualcommMarkham, ON
CA$90,400 - CA$135,600

About The Position

Do you love working on state‑of‑the‑art technologies? Are you a visionary who enjoys solving complex technological and architectural challenges? At Qualcomm, our System Architecture team offers a collaborative, hands‑on environment that fosters engineering excellence, creativity, and innovation. In this highly visible role, you will be part of the System Architecture team driving a high‑performance, low‑power Display Processing Unit (DPU). The DPU is an integral component of Qualcomm Snapdragon platforms across multiple business units, including mobile, automotive, VR, and wearable. It is responsible for composing the final image displayed on the panel. The Display Co‑Processor (DCP) is a RISC‑V–based processor that offloads display‑related tasks from the main application processor. It serves as a dedicated engine that manages display timing, formatting, and coordination to ensure smooth, efficient, and power‑optimized display operation. New Position

Requirements

  • 2+ years of experience in modern CPU architecture and microarchitecture, with a focus on performance, power efficiency, and parallelism
  • 2+ years of programming experience in C/C++, and scripting languages such as Perl and/or Python
  • Strong understanding of CPU microarchitecture concepts such as pipelines, out‑of‑order execution, cache and memory hierarchies, and performance bottleneck analysis
  • Experience with CPU performance benchmarking, workload characterization, and model‑to‑RTL/silicon correlation
  • Strong analytical, critical‑thinking, and problem‑solving skills applied to complex architectural trade‑offs
  • Excellent written and verbal communication skills, with the ability to clearly present architectural insights and recommendations
  • Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Systems Engineering or related work experience.
  • Master's degree in Engineering, Information Systems, Computer Science, or related field and 1+ year of Systems Engineering or related work experience.
  • PhD in Engineering, Information Systems, Computer Science, or related field.

Responsibilities

  • Microarchitectural workload analysis: Analyze real‑world and synthetic workloads to identify IPC, latency, bandwidth, cache hierarchy, and QoS bottlenecks across the CPU pipeline
  • Pre‑silicon architectural modeling: Develop and maintain CPU architectural and microarchitectural performance models; evaluate design trade‑offs through what‑if studies (e.g., pipeline depth, execution resources, cache structures, memory latency tolerance)
  • RTL and silicon correlation: Correlate architectural models with RTL and post‑silicon measurements to validate assumptions and guide design refinements
  • System‑level performance analysis: Assess CPU interactions with NoC, memory subsystem, QoS/MPAM mechanisms, and power‑management features, and quantify their impact on performance and efficiency
  • Cross‑disciplinary collaboration: Work closely with CPU architects, RTL design and verification teams, SoC performance teams, and software partners to drive informed architecture decisions

Benefits

  • competitive annual discretionary bonus program
  • opportunity for annual RSU grants
  • highly competitive benefits package
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service