Technology Resource Experts, LLC is looking for an experienced High-Performance Computing (HPC) Systems Engineer to support complex system design, integration, monitoring, and diagnostics by applying deep understanding of both physical and logical system architectures. Position Description Maintain a comprehensive understanding of the system’s end-to-end physical and logical architecture to effectively apply hardware modeling and diagnostics (HMD) monitoring tools. Leverage HMD monitoring tools to identify, narrow, and triage system issues, directing detailed problems to the appropriate diagnosticians or vendors for resolution. Develop deep expertise in the HMD product and monitoring architecture to identify gaps, inefficiencies, and opportunities to enhance diagnostic effectiveness. Collaborate with developers, analysts, and monitoring tool owners to propose, design, and implement improvements to monitoring solutions, increasing system reliability, and operational visibility. Analyze system logs, metrics, and telemetry—primarily using Splunk—to determine root causes, understand system behavior, and identify anomalous conditions. Interpret hardware and system performance data, including graphs and trends, to diagnose system behavior and inform troubleshooting activities. Guide the development of Splunk dashboards, health indicators, and diagnostic scripts to monitor critical data flows, system performance, and failure signatures. Review and evaluate relevant technical documentation; ask clarifying questions and build expertise in hardware design to support accurate and timely system diagnosis. Provide recommendations for testing strategies and develop documentation of issue signatures to enable and accelerate diagnostics development. Collaborate closely with diagnosis teams and external vendors to troubleshoot complex hardware and system-level issues. Track issues through resolution using JIRA, validate fixes, and confirm that corrective actions resolve the underlying problems.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed