Principal Software Engineer

Red HatRaleigh, NC

About The Position

The Red Hat Performance and Scale Organization is looking for an experienced Principal Software Engineer to join the Team. This strategic role will require you to demonstrate strong technical leadership, drive roadmap development, and take ownership of testing, measuring, and analyzing the performance and scalability of different Red Hat Products on the sovereign cloud project. What You Will Do Work closely with management, product owners, developers, and quality engineers to understand product requirements and architect test plans to meet the roadmap requirements of different Red Hat Products. Develop sophisticated tests that simulate user workloads through comprehensive end-to-end automation, leveraging custom-built and state-of-the-art open-source tools and frameworks. Collaborate closely with key stakeholders and senior management to ensure alignment of team resources and capacity with the incoming testing requirements, in line with product delivery timelines. Deep dive into performance issues with the intent of discovering their root causes in complex distributed systems. Design and develop monitoring and reporting tools for performance and scale tests and analysis. Document your research and results clearly and concisely, and communicate findings both internally and externally. Engage in upstream communities to help test performance and scale early and influence design and development decisions. Triage, debug, and root cause customer issues related to sovereign cloud project performance and scale. Present your work and findings at internal and external conferences.

Requirements

  • 7+ years of professional software engineering experience
  • Bachelor’s degree or higher in Computer Science, Engineering, or a related field (or equivalent experience)
  • Demonstrable experience, understanding, and passion for performance engineering.
  • Working knowledge of Kubernetes or OpenShift.
  • Strong programming, debugging, and profiling skills in Python and/or Golang.
  • Hands-on experience with performance measurement, analysis, and optimization.
  • Experience with distributed systems.
  • Very strong Linux system administration and system engineering skills.
  • Solid scripting skills, particularly with Bash, Python, or Ansible.
  • Experience working with public clouds like AWS, Azure, GCP, or IBM Cloud, as well as bare metal environments.
  • Experience analyzing and interpreting large volumes of test results and succinctly communicating findings through easy-to-understand graphs/charts.
  • Experience with collaborative software development methodologies, tools, and version control.
  • Knowledge of statistical analysis and experimental design techniques.
  • Excellent communication and interpersonal skills.
  • Ability to work independently and proactively seek collaboration.

Nice To Haves

  • Experience with container technologies like Podman or Docker, and familiarity with building container images.
  • Experience with system performance engineering and metrics collection tools like iostat, vmstat, sar, perf, and Prometheus.
  • Master’s Degree in Computer Science preferred.
  • Experience with monitoring and dashboarding tools like Prometheus and Grafana.
  • Experience testing different network adapters and hardware offload technologies.
  • A demonstrated history of contributing to open-source projects.
  • Presentation skills and public speaking abilities for conferences and demonstrations.

Responsibilities

  • Work closely with management, product owners, developers, and quality engineers to understand product requirements and architect test plans to meet the roadmap requirements of different Red Hat Products.
  • Develop sophisticated tests that simulate user workloads through comprehensive end-to-end automation, leveraging custom-built and state-of-the-art open-source tools and frameworks.
  • Collaborate closely with key stakeholders and senior management to ensure alignment of team resources and capacity with the incoming testing requirements, in line with product delivery timelines.
  • Deep dive into performance issues with the intent of discovering their root causes in complex distributed systems.
  • Design and develop monitoring and reporting tools for performance and scale tests and analysis.
  • Document your research and results clearly and concisely, and communicate findings both internally and externally.
  • Engage in upstream communities to help test performance and scale early and influence design and development decisions.
  • Triage, debug, and root cause customer issues related to sovereign cloud project performance and scale.
  • Present your work and findings at internal and external conferences.

Benefits

  • Comprehensive medical, dental, and vision coverage
  • Flexible Spending Account - healthcare and dependent care
  • Health Savings Account - high deductible medical plan
  • Retirement 401(k) with employer match
  • Paid time off and holidays
  • Paid parental leave plans for all new parents
  • Leave benefits including disability, paid family medical leave, and paid military leave
  • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service